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ABSTRACT 

The first part of this thesis considers ttie complexity of Boolean 
functions. The main complexity measures used are the number of gates in 
combinational networks and the size of Boolean formulas. The case of 
monotone realizations, using only the operations of AND and OR, of monotone 
functions is emphasized. 

For a particular class of monotone functions, the quadratic functions, 

the worst-case values for the monotone circuit complexity is shown to be 

o 
proportional to n /log n. The number of A-gates necessary to compute any 

quadrastic function is also analyzed. 

A technique for deriving bounds on monotone circuit size of threshold 

functions is applied to the "majority" function (threshold n/2) to establish 

a lower hound on its monotone circuit complexity of 3n-0(l). For the function 

"threshold 2", previously known lower bounds on the number of v-gates required 

are extended in the case of a circuit which has a minimal number of A-gates. 
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As a result, it follows that no monotone circuit for "threshold 2" can 
simultaneously have both a minimal number of A-gates and a minimal number 
of v-gates. 

The complexity of combinations of functions on disjoint sets of variables 
is studied, and a gap between the formula and circuit size of a particular 
function is given. 

Finally, we study the effect of allowing negation in a formula for mono- 
tonic functions. Examples are given both of functions in which using negations 
allows more succinct expressions, and functions in which it does not. 

The second part of the thesis describes an algorithm for computing shortest 
paths in a graph. These results show that an algorithm originally proposed 
by Spira for this problem can have slow running time. The lacuna in his 
algorithm is repaired, and it is shown to have 0(n 2 (log n) 2 ) average running 
time over wide classes of graphs as Spira originally claimed. As a special 
case, a transitive closure algorithm with 0(n 2 log n) average time is also 
described. 
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CHAPTER 1 
Introduction 



This thesis consists of two parts, teach devoted to a different 
area in the theory of computation. It is primarily devoted to a 
study of the complexity of Boolean functions, with an emphasis on 
monotone Boolean functions. A secondary objective is the analysis 
of algorithms which compute the shortest distances between all points 
in a graph and which compute the transitive closure of a Boolean 
matrix. 

In general, computational complexity theory asks questions 
about the computational resources required to solve a problem or 
compute a function. Traditional complexity theory has been pri- 
marily concerned with the difficulty of computing recursive functions 
on various models of machines such as Turing Machines or Random 
Access Machines; measures such as time or space on these machines 
are considered. Finite functions (i.e. those with finite domain), 
since they are computable by finite state machines, are inherently 
"easy" according to these definitions since they require only enough time 
and space to read the input and print the output. Several models 
of computing machines for finite functions have been proposed over 
the past years, and these have been studied as a means for defining 
the complexity of finite functions. Some of these, such as 
circuit complexity, receive their motivation from computer technol- 



\, 
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ogy, while others such as formula size are more mathematical in 
origin. In the work which has been done so far, these models are 
seen to have a rich mathematical structure which is only now begin- 
ning to be understood to any extent. The bulk of the research con- 
tained in this thesis; is a study of several problems in this area 
of finite computation. 

We restrict ourselves to the Boolean case in which the finite 
domain and range are vector spaces over tbe set {0,1}. The original 
work in this area was directed towards finding asymptotic results 
about the worst-case complexity of all Boolean functions on n 
variables. Work of Shannon [1949], Lupanov [1958,1962], and others 
[Krichevskii 1961] established that "most" Boolean functions on 
n variables have minimal formulas of size asymptotic with 2 n /log n, + 
and minimal circuits of size asymptotic with 2 n /n. During the 
1950' s attention was focused more on individual "practical" problems, 
and on finding optimal or near-optimal canonical formulas and 
circuits for a function with a given input-output specification. 
Work of Quine [1952,1955] and others [McCluskey 1956, Karnaugh 1953] 
explored methods for finding efficient ways of computing or expres- 
sing a specific function. 

Recent research has been directed primarily at finding exact 
values for the complexity of individual functions in different 
models of complexity. As an adjunct to this pursuit, other more 



In this report, all logarithms are to the base 2. 
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basic questions such as the relationship between complexity in 
different models have been studied. This search for optimal 
circuits and formulas is only indirectly motivated from an engi- 
neering point of view , but is relevant to complexity theory. 

Several authors [Pippenger and Fischer 1977, Schnorr 1976d] 
have demonstrated a connection between the Turing Machine complexity 
of a function and the circuit complexity of finite restrictions of 
that function. These results essentially shew that a function is 
easy to compute with respect to oracle Turing Machine time if and 
only if it has a small circuit on each set of finite length inputs. 
This work relates to recent studies of non-deterministic polyno- 
mial time (MP) and to the WP-complete problems of Cook [1971] and 
Karp [1972] because it implies that if P = NP t then the finite 
restriction of any problem in np has a polynomial -si zed circuit. 
While this latter question is obviously still open V this observation 
has given impetus to many researchers to explore the possibility of 
proving large lower bounds on the complexity of specific Boolean 
functions. 



t 

Considerations such as the number of gates in a circuit or the 

delay time are not as important as other factors such as the inter- 
connections between gates and the fan-out of the gates. 
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The results obtained thus fair have not been very encouraging 

in pursuit of this goal. The lower bounds obtained may be divided 

into two types. Diagonal-type arguments can obtain exponential 

lower bounds on the complexity of specific functions. For example, 

Stockmeyer [1974] and Meyer have shown that, since there are functions 

which are recognizable on a deterministic Turing Machine in space 2 n 

whose restrictions to inputs of fixed length have exponential circuit 

complexity, one can get a lower bound of c n on any circuit which 

computes a restriction of certain logical languages. Thus far, 

results of. this sort have been limited to Boolean functions which 

are binary codings of problems which are provably difficult to compute 

on a Turing Machine; the functions involved have not arisen as 

Boolean functions in their own right. Moreover, since the WP-complete 

problems are not yet provably difficult for a Turing Machine, these 

techniques do not apply to them. 

Direct arguments for particular functions have not yielded much 
in the way of lower bounds. For single*ot*tput functions, no lower 
bounds on circuit complexity larger than linear in the number of 
inputs have yet been proven. 

The order of these bounds has not been improved even when the 
allowable gates in the circuit are highly restricted. For the 
complete basis of all binary Boolean operations, the largest of 
these lower bounds is 2.5n recently discovered by Paul [1977] 
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and Stockmeyer [1977]. f For restricted bases, the largest known lower 
bound is 7n for particular functions [Redfcin 19731. Work contained in 
this thesis establishes lower bounds on the monotone circuit complexity 
of a particular threshold function which is equal to the largest lower 
bound achieved for this basis (namely 3n). 

For multi -output functions, larger lower bounds have been achieved 
on the circuit complexity in certain bases by exploiting the interplay 
between the functions computed by different outputs. For example, 
Paterson [1975] and Mehlnom and Galil [1976], Indepently extending earlier 

work of Pratt [1974], have shown that the monotone circuit complexity of 

3 2 tt 
the multiplication of two nxn matrices is 2n -n . Several other 

researchers have used different techniques to obtain lower bounds propor- 
tional to n log n on functions related to sorting problems [Lamagna and 
Savage 1974, Lamagna 1975]. 

We observe that in both cases the bounds obtained are far short of 
the exponential lower bounds theoretically possible for most functions. 



f Recently, Lipton and Tarjon [1977] exhibited larger bounds for 
planar circuits, and Schnorr [1978b] has announced a 3n lower bound for 
arbitrary circuits. 

++ Recently, these techniques have been extended to yield an n(n /log n) 
lower bound on the monotone circuit complexity of certain functions 
[Wegener 1978] 
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For formulas the results are also limited. If arbitrary con- 
nectives are allowed in the formula, the largest lower bounds are 
given by a general technique of Nec'iporuk C 1 966] which he uses to 
establish lower bounds proportional to n /log n for a specific 
function. Harper and Savage use Nec'iporuk 's method to establish 
lower bounds on the complexity of the marriage problem [1972]. 
This technique is used later in this thesis to exhibit a monotone 
function for which negations don't allow a reduction in the formula 
size by more than a constant factor. Other lower bounds for 
formulas in which arbitrary binary connectives are allowed have 
been reported by several authors [Hansel 1964, Hades and Specker 
1968, Vilfan 1972, and Fischer, Meyer and Paterson 1975]. For more 
restricted sets of connectives, Khrapchenko [1971] established 
lower bounds on the {a,v,-i} formula complexity for the parity 
function proportional to n 2 , and this is the largest polynomial 
bound yet established for formula complexity. 

The principal research contained in this thesis is the estab- 
lishment of lower bounds on the complexity of specific Boolean 
functions, but includes more general work on asymptotic bounds on 
the complexity classes of Boolean functions and on the relation- 
ship between several complexity measures. In particular, we begin 
by studying the monotone circuit complexity of quadratic monotone 
Boolean functions. (Those for which each prime impl leant is a 
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product of two distinct variables.) Using siople counting techniques 

2 t 
we show that most quadratic functions require at least fl(n /log n) 

gates to compute in any circuit. Cti the other hand, we show that 

2 

a monotone circuit of size proportional to n /log n exists for 

every quadratic function. When counting individual types of gates 
in a monotone circuit, we show that n-1 A-gates suffice to compute 
any quadratic function, and exhibit a specific function for which 
2n/3 A-gates are necessary 1n any monotone circuit. 

In chapter 4 we examine the question of whether it may be 
easier to compute combinations of several functions than to compute 
the individual functions and then combine them. Both monotone 
and general circuits and formulas are considered. As a corollary 
to the work reported there, we establish the existence of a function 
of polynomial complexity which has smaller circuit complexity than 
formula complexity — in fact their ratio is proportional to 
n/log log n. 

Chapter 5 contains an examination of the monotone circuit 
complexity of the threshold functions. Using techniques similar 
to Paul [1977], Stockmeyer [1977], and Schnorr [1974], we establish 
larger lower bounds on their complexity than previously known. 
These bounds are in some cases quite small in comparison with the 
best known upper bounds. In particular, for the function 
"threshold n/2" of n variables, we establish a lower bound of 



t 
For notation, see page 51. 
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3n-7 gates necessary in any monotone circuit; on the Other hand, the 

■ 2 
smallest known circuit has a nuanber of gates proportional to n(log n) . 

For the function "threshold 2" of n variables, we present exact 
bounds on the number of A-gates and v-gates for any monotone 
circuit. These bounds were observed by F.F. Yao + t We extend 
these bounds in the case of a circuit with the minimal number of 
A-gates for values of n which are a power of 2, and show that for 
these values of n it is impossible to simultaneously achieve the 
minimum number of both types of gates. The latter result was again 
previously observed by F.F. Yao. 

Finally, we study the effect of different bases on the formula 
complexity of monotone functions. In work done jointly with M. 
Paterson, we exhibit a function with monotone formula complexity 
proportional to n /log n for which any formula in an arbitrary 
basis can be no more than a constant factor smaller. On the other 
hand, in work done jointly with A. Meyer, we show that there are 
monotone Boolean functions for which the smallest monotone formula 
is larger by a factor of e(n) than a formula for the function in 
the basis of all binary operators. 



Becently these techniques have been extended by the author to yield a 3n-0(l) 
lower bound on the {a, v, -.} complexity of this function. 



tt 

Personal communication, 1975. 
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The final part of this thesis deals with the problems of 
determining the transitive closure of a Boolean matrix and deter- 
mining the shortest distances between all points of a non-negatively 
weighted graph. P.M. Spira [1973] has published an algorithm to 
find the shortest distance matrix which he claims has an average 
running time (over a large class of weighted directed graphs) of 

2 2 

0(n log n). Research by A. Meyer, M.J. Fischer, and this author 

has demonstrated classes for which his algorithm has slow average 

running time. A revision of his algorithm does indeed have 

2 2 
0(n log n) average time, over even wider classes of graphs than 

Spira claimed. This result is presented in Chapter 7; it is also 

2 

shown there that a further revision yields a simple 0(n log n) 

average time algorithm to compute the Boolean transitive closure 
over a wide class of probability distributions on matrices. 

We begin by presenting some introductory remarks on Boolean 
functions. 
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CHAPTER 2 

Definitions and Preliminary Results 

Section A. Boolean Functions 

An n-inp ut, m-output Boolean function is a function with 

domain {0,l} n and range {0,l} m . We will denote the set of all 

n-input, m-output Boolean functions by B . We will generally be 

interested in one-output Boolean functions and denote the set B n 1 of 

all such functions with n inputs as B . If feB n , we usually 

n n ,in 

denote an arbitrary element of the domain of f by I s (x, .Xg.. .. »x n ), 

and refer to each x. for 1 s i <, n as a variable of f . Some 

specific Boolean functions we will refer to are the unary function 

negation or NOT (denoted -i ) , and the binary functions disjunction 

or OR (v) , conjunction or AND (a or • ) + , EXCLUSIVE OR or mod 2 sum 

(© ), equivalence (=h and the constant functions 0_ and ]_. 

The sets of functions B have been extensively studied from 

n,m 

a mathematical point of view [Harrison 1965, MacLane and Birkhoff 
1967J. We assume that the reader is familiar with elementary 
properties of these sets. We use notation which is standard and 
refer the reader to Harrison [1965] for elaboration and proofs of 



t 

Frequently we will denote the conjunction of variables x and y 

by their concatenation xy. 
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results that we only state . 



Boolean functions may be combined in standard fashion. In 

particular the composition of functions is defined as usual.. If 

f and g are members of B for some n, we Jdefifle fvgeB n by 

(fvg)(x) = f(x)vg(x) for any te{0,l} n . If a collection of functions 

(f. | iel) is each a member of B n , the disjunction ^jf 1 cB n is 

defined in the obvious fashion. The conjunction, of functions is 

defined similarly. In addition, we may define, from the functions 

f e B « m and geB„ m , the function fxgeB_ .._. _ +_ by 
n, ,m, 3 n^jm^ n-j+n2,m^+m2 



fxg( 



x 1 ,x 2 ,...,x n .y^yg,...^ ) = (f(x 1 ,...,x n ), g(y 1 y n )) 



m, m. 



iii-i mo 
(where we use the obvious isomorphism between {0,1} x{Q,l} and 

{0,1} ' c ). The combination v(fxg) is defined, for feB n 



'1 



+ Examination of the properties of Boolean functions indicates 
strong similarities among them. In fact, they may be grouped into 
pairs in which occurrences of a and v are interchanged, as well as 
occurrences of and 1. Such statements are called duals of each 
other. It is a well known fact that a Boolean statement fs true if 
and only if its dual is true. This is known as the pricipal of 
duality , and permits some economy in proving properties of Boolean 
functions. 
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and geB , to be the obvious composition of v and x ; that is 
n 2 

v(f*g)(x T ,...,x n ,y r ...,y n ) = f(x ] x n )vg(y ] y n ). 

A(fxg) is defined similarly. 

Two properties of these combinations which we will use are 

stated in the following lemma: 

2.1 Lemma : Suppose f and g are members of B for some n s 0. 
Then (a) if fvg = 0, the constant function 0, then f = and g = 

and (b) if fAg = j_, then f = 1_ and g = 1_. 

One specific set of Boolean functions is the set of projection 

functions . The i projection function of n variables, 

denoted nj, is defined by n"(x ] ,. ., ,x n ) = x j for each i in 

{l,2,...,n}, If f is an arbitrary member of B ^, we denote 

n,m 

by f.. the function njof, the i th component of f. 

We say that a function feB functionally depends on its 
i variable x i if there are constants c and t in {0,1 } n 
which differ only in their i th positions for which f(c) t f(cf). 
Finally, if A is some set of input variables for f, and 
c x e{f),l} is a constant defined for each xeA, then we denote the 
restricted function of the remaining variables which is obtained 
from f by setting each variable x in A to c by 

f I 

'x = c for xeA ' 

A 

We are particularly interested in one subclass of Boolean 
functions, the monotone Boolean functions (m.b.f.'s). Observe that 
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the set {0,1}" is partially ordered by the rule 

(x 1 ,...,x n ) s (y 1 »...»y n ) iff x i is y i for i = l,2,...,n 
(where Oil). 



The set B is partially ordered (in fact is a Boolean algebra) 

n.tn r 

by defining, for f.QeB^, 

f 5 g iff f(x) s g(x) for all x e {0,l} n . 



A Boolean function feB n,ra is monotonic increasing , or 
monotone for short, if and only if it preserves the partial 
ordering s; that is, if f(x) s f(y) for every pair of inputs x,ye{0,l} n 
for which x s y. 

Specific examples of m.b.f.'s include the function 
threshold Jc of n variables, denoted TJJ, which is defined by 

jP(x, ,...,x ) =1 if and only if at least k of the variables 

( x, ,...,x ) have the value 1. The threshold 1 function is the 

disjunction x,vx <J v...vx ; the threshold n function is the 

conjunction x,ax 2 a...ax . Another m.b.f. is the Boolean 

multiplication of two n*n matrices A and B, a function in 

n 
B , , which is defined by A«B = C where C.. = v (A..aB. .) 
2n Z, n d • n k*l 1K * KJ 

for each i and j in {1,2,. ..,n}. A third set of m.b.f.'s 
is Boolean convolution, in which we define feB n 2 n _j by 
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f k Oc ,...,x n _ 1 ,y ,...,y n _ 1 ) = ^^(x^y.) 

for each * k s 2n-2, 

The m.b.f. 's satisfy several properties which are not true of 
Boolean functions in general. The following facts are easily 
verified. 

2.2 Lemma : Suppose f and g e B n are monotone Boolean functions. 

Then 

(a) if fAg = 0, then f = or g = 0. 

and (b) if fvg = J_, then f = j_ or g = 1_. 

Further properties of the m.b.f. 's will be explored later in 
this chapter. It is useful at this point to point out one 
property of the ordering relationship on B . 

2.3 Lemma : Suppose that f, g, and h are functions in B . If 

n 

f <. h and g < h, then 

(1) fAh = f 

(2) fvh = h 

and (3) fAg < f < fvg < h. + 



We remark that the dual of an arbitrary function feB is the function D(f) in 

B n defined by D(f)(x) = -f(.-ix 1 , -tX 2 ,...,-h x ( J. The truth of a statement 

(cont. on next page) 
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Section B. Representations of Boolean Functions 

There are many different ways of representing Boolean functions, 
some of which give rise to the complexity measures discussed in this 
thesis. One standard representation is the truth table of the value 
of the function on different inputs. The unifying representation 
used in this thesis is that of a combinational (or gate-type) 
switching circuit , and is identical with those studied by numerous 
authors [Harrison 1965, Savage 1976, Paul 1977, Schnorr 1974 and 

others]. The underlying structure is a graph, which we now define. 
A directed graph D consists of a pair of finite sets (V,E), 
where V is a set whose members are called nodes or vertices , and 
E c VxV is a set whose members are called edges or arcs. Another 
notation for V and E is NODES(D) and EDGES(D) respectively. 
If (v,w)e EDGES(D) we say there is an edge from v to w in D. 
A path or chain of length Jc in D is a sequence of nodes 
v Q ,v, v k (for k ^ 0) of D such that there is an edge from 

v. to v. +1 for every 1£{0,1 ,...,k-l}. A graph D is acyclic 
if there is no path from any node A of D to itself other than 



like Lemma 2.3 is unchanged if every function i$ replaced by its 
dual. 

Observe that if f s g, then D(f) * D(g), and that the dual 
of a monotone function is monotone. 
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the trivial path of length 0. For any node A of an acyclic graph 

D, we define the following subsets of nodes of D which pertain to 

A: 

Succ(A.D) = {BeNODES(D) | there is an edge from A to B} is 
the set of immediate successors of A, 

Succ*(A,D) = {B£N0DES(D) | there is a path from A to B in D}, 

Succ + (A,D) = {BeNODES(D) 1 there is an path of length k il 

from A to B}, 

Pred(A.D) = {BeNODES(D) | there is an edge from B to A} is 
the set of immediate predecessors of A, 

Pred*(A,D) = {BeNODES(D) | there is a path from B to A in D}, 

and Pred + (A,D) = {B e N0DES(D) | there is a path of length k & 1 

from B to A}. 

If W is some subset of nodes of D, we may extend the above 

devinitions to W and speak, for example, of the set of immediate 

successors of W defined by Succ(W^D) = USucc(A.D). Finally, 

A e W 

the i ndegre e or fan -in (respectively outdegree or fan-out) of a 

node A is the number of edges directed into (out of) A in D 

and is denoted indeg(A,D) (outdeg(A.D)). In general, we will omit 

mention of the graph D in the above notation when the graph is 

clear from context. 
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Suppose n is a finite set of primitive functions 9j«B 

called the basis . A combinational circuit or network over the 
basis n, in short an n-circuit , is a directed acyclic graph N 
together with a labelling of the nodes and edges of N which is 
subject to the following constraints and definitions: 

(1) For some positive integer n, at most n of the nodes 
with indegree zero are given distinct labels from the 

set {x, ,x 2 ».. . ,x }; we refer to such nodes as input nodes 
and denote their set by INPUTS(N). 

(2) There may be one node of indegree zero labelled with the 
symbol ZERO, and there nay be one node of indegree zero 
labelled with the symbol ONE; such nodes are referred to 
as constant nodes . 

(3) There are no other nodes of indegree zero other than those 
mentioned in (1) or (2). 

and (4) Each node with indegree one or more is labelled with some 
member g. of fi. Each such node is a gate node , and 
their set is denoted GATES(N). Such a node G labelled 
with g. must have indegree 1n N equal to the arity 
n. of g.; furthermore, each edge directed into 6 must 
be labelled with one of the integers {1,2,...,^} in 
such a way that every edge into G gets a different label. 
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An example of such a network is given in Figure 2.1, where the 
basis is the set {a,v,-i }. (For figures, the arcs in the graph are 
always directed downward. Arbitrary nodes are designated by triangles, 
constants or inputs by rectangles, and gates by circles). 




Fig. 2.1 A Combinational Circuit 
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If the input nodes to an n-circuit N are labelled with members 
of the set {x, ,...,x >, we may associate a Boolean function in B 
with every node A In K by the following inductive rule. We denote 
the associated function by Res(A,N). If A is an input node 
labelled with the variable x., then Res(A,N) is the projection on 
the i coordinate, i.e. the function iq. If A is the constant 
node ZERO (respectively ONE), then 1 RESj(A,N) =0, the constant 
function in B equal to on all inputs (respectively RES(A,N) = 1, 
the constant function in B n equal to 1^). Final lyv suppose A is 
a gate for which every node in Pred(A) has an associated function. 
If we let B k be the node in Pred(A)sucJ» that the edge from B. 
to A is labelled with k, and if A is labelled with the basis 
element g.eft, then 

Res(A.N) = g 1 (Res(B l ,N),Res(B 2 ,N) Res(B n ,N)). 

We will say that an n-circuit N computes a function feB R m 

iff for each i, 1 <s i s m, there is a node A^ of l« such that 

Res(A.,N) = f. , the i th component of f. For example, the 

circuit given in Figure 2.1 computes the function ffx^x^x-j) ■ 

x, & x 2 flai x 3 since the gate with outdegree computes this function. 

We may also say that the circuit computes the function 

g(x, ,x 2 ,x 3 ) = (x, , x,a-, x 2 ) since there are gates which compute 
each component of g\. A node which computes a component of the 
designated function will be called an output node. 
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A basis a is called semi -complete iff given any function feB 

for arbitrary n, there is an n-circuit as described above which 

computes f. The basis is called complete if any f can be computed 

by an n - circuit which does not contain the nodes ZERO and 

ONE (i.e. constants are not available). Three complete bases 

in which we will be interested are B 2 , the set of all binary 

Boolean functions, B. = BjU B 2 , and U = {A,vft}. An example 

of a semi-complete basis is the set to, a} . An exhaustive 

characterization of all complete bases has been made by Post [1941], 

and the interested reader is referred to this work. 

One incomplete basis which holds special interest is the 

set M = {a,v} which is important for the following reason. 

2.4 Theorem : A Boolean function feB is monotone iff there 

n,m 

is an M-circuit which computes f. 

For a proof of Theorem 2.4 the reader is referred to [Harrison 
1965 p. 189]. 

The bulk of this thesis is concerned with the results about 
M-circuits (also called monotone circuits) computing monotone 
Boolean functions. 

One special class of circuits which holds special interest 
is the set of Boolean formulas. An fl-formula is an n-circuit 
in which each node in the circuit has outdegree at most one, and 
which has a unique node of fan-out zero corresponding to the unique 
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output node. For a formula, we remove the restriction on the 
circuit that there be only a single node for each input variable 
and constant, and allow any number of nodes of indegree zero 
labelled with any input variable or constant, each node having 
outdegree one. For an example see Figure 2.2. The graph of any 
a- formula is a tree in which each leaf node (one with indegree 
zero) is labelled with a variable or constant, and each interior 




Fig. 2.2 The Formula (x, v Xg) a f-ixj v x 3 ) 



node (every other node) is labelled with a gate. This formulation 
is easily seen to be equivalent to the usual inductive definition 
of an ft-formula. Clearly a formula can compute only a single-output 
function. 
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Classically, Boolean formulas have been extensively studied 
as a means of representing Boolean functions. Many formula schemes 
have been proposed as canonical means for their representation, 
including the classical disjunctive normal form. In this method, 
a function feB is represented as the formula 

C l C 9 c « 

(c««.....c )eA. 

.- ( x if j = 1 
where x J a \ 

( nx if j = 

for any variable x, and A f is those set of constants ce{0,l} 
for which f(c) = 1. Other canonical forms are discussed in 
[Savage 1976 J. 

If F is a formula, and H is a node in F, then F H will 
denote the subformula of F above Hj that is, Pred (H,F). 

In this thesis, we will occasionally replace part of one 
circuit by another circuit and speak of cofflbinations of circuits. 
For example, suppose that N and N' are a-circuits which contain 
nodes A and A' respectively. The circuit obtained by replacing 
A by_ A^ is constructed by first identifying the corresponding 
input and constant nodes of N and N' and considering them as 
one network. Node A together with each arc directed into it is 
removed from the graph, and each edge (A,B) originally in circuit 
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N 1s replaced by the edge (A'.B) (with the same label as previously). 

For example, see Figure 2.3. The replacement of a node in a circuit N 






F1g. 2.3 The Replacement of A by A' 
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by a node in the same circuit is defined similarly, but substitution 
must not be made in a manner which introduces a cycle into the 
graph. 

One may say little in general about the functions computed by 
a circuit in which one gate has been replaced by another. Specific 
situations may allow some conclusions to be drawn. For example, 
if node A in N is replaced by a node A' in N' for which 
Res(A,N) = Res(A',N')i then it is easy to prove by. induction on the 
length of paths that every remaining node in N computes the 
same function as it did before the substitution. 

One special replacement is the substitution of a constant 
for a variable. We say that N' is the circuit obtained from N 
by setting variable x. to (respectively 1) if N' is obtained 
by replacing variable node x i in • N . by the constant node ZERO 
(respectively ONE). One inductively defines setting a collection 
of variables to a constant. It is again easy to verify the 
following fact. 

2.5 Lemma : Suppose N is an n-circuit, and x.€lNPUTS(N). If 
N' is the circuit obtained by setting variable x. to , then 
for each remaining node A in N' , 



Res(A,r) = Res(A,N)| x = Q 
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A similar statement may be made if x* is set to 1. 

Observe that, depending cm the basis, if a gate in a circuit 

has one input which is constant, then some simplification of the 

circuit may be made. For example, suppose the basis is the set B 2 

of all binary Boolean functions. If a gate has a constant input 
then, except for trivial cases, it may be eliminated from the 
circuit since the output of that function 1s then a unary function 
of the other predecessor. Since this unary function may be 
absorbed into a preceding or succeding binary gate (if one such 
exists), then one may obtain a smaller circuit equivalent to the 
original. See Figure 2.4 for an example. Such simplifications may 




ONE 



Ej] 



Fig. 2.4 Some Simplifications 
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always be made in the basis M. This elimination of gates will be 
a principal tool by which lower bounds on circuit complexity are 
obtained. 

One additional representation for a Boolean function is as a 
set of prime implicants. In the interest of brevity we restrict 
ourselves to monotone Boolean functions, although a similar devel- 
opmen may be made for all Boolean functions. 

A monome or product is a product of distinct variables, e.g. 
X 1 X 2 X 5 ^ s a monome ' In particular, we denote the empty monome 
by t. A monome m includes a monome m' if every variable 
which appears in rrr also appears in m. We denote the function 
defined by a monome m by T(m) , and if M is a set of monomes, 

then T(M) will denote the function VT(m). T(e) is the 

mcM 

constant function 1_. 

Suppose that f is a monotone Boolean function. A monome m 
is an imp! i cant of f iff T(m) <. f; m is said to be a prime 
implicant of f if m includes no other implicant of f. We 
denote the set of all prime implicants of f by PI(f) + . 

ion 



For example, suppose f(x 1 ,x 2 ,x 3 ) is the threshold 2 f unctt< 
of three variables, namely T* . Then x^^ is 

an implicant of f, but is not a prime implicant since the monome 



One defines V^m) = p_. Hence PI(0J - 0, the empty set, and PIO) = { e }. 
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x,Xg is also an implicant of f. PI<f) is the set {x.,x 2 ,. x 2 x 3 » x^}. 

The importance of the set of prime impli cants is that they are 
a canonical representation for the m.b.f. 's, and are fairly easy 
to manipulate mathematically. A straightforward proof (see 
[Harrison 1965 p, 190] for example) -demonstrates the following fact 
of their canonicity. 

2.6 Lemma (Quine): Suppose feB is a monotone Boolean function. 

Then f = V T(m). Moreover, if l> is any set of monomes such 

BtePUn 
that no monome in P, includes another monome in F, and 

f=T(R), then P = PI(f). 



The set of prime impli cants of a combination of functions can 
be obtained from those of the constituent functions by means of the 
following simple set of rules. Suppose that f and g are m.b.f. 's 
in B . Then for any monome m, m 1s an implicant of the function 
fvg if and only if it is an implicant of f or an implicant of g. 



+ 
Note that we equate the monomes x,Xp and x«x, . Actually, the 

set of all monomes is equivalent to the free monoid on' the elements 

{x,,X2,...,x }modulo the relations of the commutati vi ty (xy = yx 

for all variables x,y) and idempotency (xx ■= x for all x). 

c 1s: the identity element for this algebraic system. 
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Moreover, Pl(fvg) is obtained from the union PI(f)uPI(g) by 

eliminating all monomes which include another monome in the union. 

For example, if PI(f Q ) = {x r x 2 x 3 ,x 4 } and PI(g Q ) = {x ] ,x 2> x 3 x 4 }, 

then PI(f vg ) = {Xj.XgiX^. If f and g arem.b.f.'s 1n 

B n , and m is a monome such that mePI(fvg), then mePI(f) or 

m£PI(g), but the converse is not in general true. By Lemma 2.1(a), 

if Pl(fvg) = 0, then both PI(f) and PI(g) are empty sets. 

The set of prime implicants of the function fAg may be 
obtained by forming the set of all products m-n of a monome 
mePI(f) and a monome nePI(g) (where duplicate variables in n 
and m are reduced by the rule x-x = x), and then eliminating 

any products which include other products in the set. For f 
and g Q of the previous example, PI(f Ag ) = {x^x^.x^.x^}. 
Again, ff tePI(fAg), then there are monomes mcPI(f) and nePI(g) 
such that m«n = t, but the converse is not in general true. 

One may relate the ordering relation * on the m.b.f.'s with 
their canonical representation using prime implicants by the following 
general result. 



2.7 Leimia : Suppose f and g fc B n arem.b.f.'s. Then f<g iff 
every monome mePI(f) includes some monome nePI(g). 



Proof : Suppose f sg, and mePI(f). By Lemma 2.6 and 

Lemma 2.3(c), T(m) s V T(n) = f s g, so by the transitivity of <, 

nePI(f) 
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T(m) s g. Hence by definition m is an implicant of g. Define 

m Q to be an implicant of g which 1s Included In m and which 
includes no other implicant of g. Then fflg is a prime implicant 
of g which is included in m. 

Conversely, suppose every monome in PI(f) includes some 
monome in PI(g), and let ce{0,l} n be such that f(c) » 1. By 
Lemma 2.6, there is some raonome me PI (f) such t3iat T( m \(c) * 1- 
By hypothesis, there is a monome n in PI (g) included in m ; 
By Lemma 2.3(c), we know that T(* ) * T(n ). Hence T(n )(t) = 1, 

and since n 1s an implicant of g, g(c) »1 as well . 

D Lemma 2.7. 

One may define classes of Boolean functions in terms of 
structural properties of their set of prime impli cants. For example, 
we will say that a m.b.f. feB n 1s quadratic If every monome in 
PI(f) consists of the product of exactly two distinct variables; 
a m.b.f. f € B n m is quadratic Iff every component f. is 
quadratic (for 1 <; i * m). 

The set of prime implicants of a function has been extensively 
used as a tool in explaining arguments in this field [Paterson 1975 
Mehlhorn and Galil 1976, Laraagna 1975]. This representation is 
particularly helpful in describing several replacements which may be 
made in M-circuita which don't affect the Output function ccnputed. 
Several general theorems have been elucidated by Mehlhorn and 
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Galil [1976] of which special cases have been used by other authors; 

we will find use for one of them later In the thesis. If N is 

any M-circuit, and A is a node in N, then we will denote 

PI(Res(A,N)) by PI(A,N), and as usual omit, mention of N whenever 

the circuit is obvious. 

2.8 Lemma (Mehlhorn and Gal 1.1 : Suppose N is an M-circuit, A is 
any node in N, and PI(A) = {n^.m^.'..,!^}. If there is no 
output node B of N with nu«n'ePI('B) for some monome n, then 
one may replace A by a gate which computes the function 
T(m-j ,m 2> . .. ,m^) without changing the function computed at any 
output gate. 

For a proof of this result and several other general replace- 
ments for M-circuits the reader is referred to the paper [Mehlhorn 
and Galil 1976]. 

One final topic which pertains to circuits in general is the 
different notati' s of dependence. We will say that a node A in 
a circuit N depends functionally on the variable x. if 
Res(A.N) depends functionally on x.. A different notion of 
structural ( path or syntactic) dependence holds if there is a path 
from input node x^ to A in N, i.e. if AeSucc*(x.,N). 
Clearly if A depends functionally on x. then it must also 
depend structurally on x , but in general the two notions do not 
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coincide. It is not known whether they coincide for minimal sized 
circuits over particular bases (M for example). However, it is 
possible to prove the following useful observation. 

2.9 Lemma : Suppose a function feB depends functionally on a 
variable x , and. N is an a -circuit, computing f with 
Res(U.N) = f for a node U in N. Then there is a path from 
Xj to U in N such that every gate in the path depends function- 
ally on x.. 

Proof : We construct the chain backwards from the output gate 
U. Let U = Uq. Inductively assume that we have constructed a 
path U^U^i ,. . . ,Uq in N such that every node in the path 
depends on x... If u" k = x., then the proof 1s complete. Other- 
wise, U^ must be a gate since no other input or constant node 
depends on x^. Hence, since Res(U.) depends on x, , there 
must be a node &ePred(U k ) which depends functionally on x^. 
We extend the path by defining . u" k+1 * B. (See Figure 2*5) 

Since N contains no cycles, the nodes selected are all 
distinct. Since there are only a finite number of nodes In N, 
the above process must eventually halt. 

Lemma 2.9 
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Fig. 2.5 The Extension 



2.10 Corollary : If a function feB , which depends on x.. , is not 
the projection function n", and if N is an ^-circuit which computes 
f, then outdeg(x- ,N) > 1. 
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Secti on C. Complexity Measures 

The models of computing Boolean functions introduced in the 
previous section naturally give rise to various complexity measures. 
If n is an arbitrary basis and H is an n-circuit, then we 
define C (N) to be the number? ?fr gate-type nodes in N. If 

feB , then we define the cortbiftattojjal (or c ircuit ) complexity of 

n,m y — c *■ 

f with respect to n, denoted t_(f) f to be the minimum value of 
C (N) for all ft-circuits N which compute f. If f is not 
definable by any n-circuit, weldef Jne^ C (f) = ». 

For formulas, one possible fpmplexity measure is to consider 
the number of gates as was done above for circuits. We instead 
use the number of occurrences of literals as our complexity measure. 
If F 1s an n-formula, we define L_(F) to be the number of occur- 
rences of variable input nodes in the circuit associated with F 
(the number of leaves of the tree corresponding to F which are 
labelled with variables). Observe that if fl is a binary basis, 
and F contains no constant nodes, then LAf) is one more than 
the number of gates in F. The formula complexity of a function 
fcB n *rith respect to n, denoted t- n (f)» is the minimum of the 
values of L„(F) for all n-formul^s* F which define f. Again, 
if f is not definable by an ft-formula, we define L«(f) " °». A 
minimal n-formula (respectively ft-circuit) for f is an n-formula 
F (respectively n-circuit N) which computes f such that 
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L n (F) = L ft (f) (respectively C n (N) ..- C fl (f)). Where there is no 
ambiguity we will use the same notations of complexity L fi for 
formulas and C~ for circuits as well as for functions. 

For the particular monotone basis M, we will use the alternate 
notations MC and ML for C M and L« respectively. For this 
basis, we also define a complexity measure which depends on the 
types of gates used in the circuit or formula. If N is an M- 
circuit, define MC y (N) and MC A (N) to be the number of v-gates 
and A-gates respectively in N. If f is a m.b.f. , we define 
MC y (f) (respectively ML (f)) to be the minimum value of MC V (N) 
as N ranges overall M-circuits (respectively formulas) which 
compute f (where here we consider a formula as a special type of 
circuit). We similarly define MC (f) and ML A (f). An v-minimal 
(respectively A-minimal) M-circuyt for a m.b.f. f is one which 
has the minimal number MC v (f) of v-gates {respectively 
MC A (f) of A-gates). 

The above are the complexity measures considered in the thesis, 
but other measures can be defined. For example, the depth of a 
circuit can be defined to be the length of the longest path in the 
circuit. This measure has been studied by Spira [1971] and more 
recently has been studied by several authors [McColl 1977, 
McColl and Paterson 1977, Paterson and Valiant 1976, and 
Borodin 1977]. One may similarly define the breadth of a circuit; 
this measure has recently been, studied by Schnorr [1976b]. Finally, 
one may choose to consider only circuits in which the outdegree of 
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of each gate in the circuit is bounded above by some integer k. 
Johnson, Savage, and Welch established the resuH that 1f • k a 2, 
such a complexity measure is proportional to C. for any complete 
basis fl, (for proof, see [Savage 1976^.24]). 

The relationship between these complexity measures of an 
individual function is well understood In some 5 circumstances and 
unknown in others. Muller £1956] pointed out that in general, the 
circuit complexity of a function in any complete Basils n is pro- 
portional to its C B complexity, fhisii' true since arty gate in 
B 2 may-be synthesized by an a -subeircult ami vice versa, so one 
can replace a gate by a subcireult W1 tfi only a Constant factor 
increase in the total number of gates*. Fbr formulas, the choice 
of complete basis can make a difference in formula size. For the 
parity function f (x t ,...,x n } = x^*. . .®x ft 4 flirapchenko [T971] 
demonstrated that ^(f^) is proportional to #, while f Q 
clearly has a formula of size n in the basis B«. Pratt [1975] 
has recently shown that the maximal gap between the complexity in 
any pair of binary complete bases canhbt be much larger by demon- 
strating that t^f) * [L B (f)] lp ^3 10 for an y f^. 

The relationship between the B 2 -complex1ty of a monotone 
function and its complexity in the incomplete basis M is not as 

In fact, C B (f) = C fi (f) for any function feB , and if f depends t 
more than one variable, then Cg(f) 1s the minimal nunfcer of binary 
gates in any B-circu1t which computes f. See [Savage 1976 p. 39] 
for a proof of this fact. 



-45- 
well understood. Specific results will be shown in Chapter 6. 

Results about the maximum values of these complexity measures 
among all Boolean functions were mentioned in the introduction. In 
brief, the number of distinct circuits with at most a fixed mumber 
k of gates is bounded above by ck c ' k for some constants c and 
c- depending on the basis. From this fact, one can easily show 
that if there are many different functions to compute , then there 
are not enough small circuits to compute them all, -so "most" of 
them require a large number of gates. What is remarkable is that 
soem clever constructions exist which enable one to compute all 
Boolean functions in certain classes with circuits or formulas 
which are close to, or even asymptotic with, the lower bounds on 
size obtained by the above argument. 

For the class of all Boolean functions, Shannon [1949] and 
Lupanov [1958] have shown that the maximal B 2 circuit complexity 
of any Boolean function f e B n is asymptotically 2 n /n. Similarly, 
it is known [Kricheyskii 1961, Lupanov 1962] that the maximal B, 
formula complexity is proportional to 2 n /log n. 

Recent attention has been focused on the class of monotone 
Boolean functions. Kleitman and Markowsky [1975], extending work 

of numerous authors, have shown that there are asymptotically 

("J 
2 / m.b.f.'s of n variables, and one obtains a correspond- 
ing lower bound of order 2 n /n 3/2 on the maximal B ? (and.'.M) 



'2 
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circuit complexity of m.b*f.*s. Pippenger [1976] has shown that 

this lower bound is attainable in showtng that any m.b.f. f^B 

can be computed in a B 2 -circuit with t$ywpf©tt<:alTy 0(2 n /n 3i ^) 

gates and an M-circuit with asyraptoti dally 0(2 n log fi/fi \ ' j gates. 

Lower bounds on the complexity of specific : functions dominate 

the bulk of this thesis. It will be helpful to point out some 

facts about minimal circuits. If fsB„ ■ and n are arbitrary, 

n»m 

then any minimal n-eircuit for f has a* #ost m gates of outdegree 
0. This is so since any gate of outdegreeO which is not an output 
gate may be eliminated from the circuit without changing the 
functions computed at the output nodes. 

In a similar fashion, one can show that no two gates in a 
minimal n-circuit compute the same function. Suppose S arid G' 
were distinct gates 1n a minimal J) -circuit N for which 
Res(G,N) = Res(G',N). Since H is acyclic, either 6/5ucc (G') 
or G'^Succ (G); w.l.o.g. we assume the, fc^rmef*/ Then we el fminate 
gate G, and replace G by 6' in PredtH.N) for each gate 
H «Succ(G,N). It is easily shown that the resijl ting circuit is 
acyclic, and that every remaining gate In the new circuit computes 
the same function as 1t did originally. This is a contradiction 
since N was a minimal n-circuit. 

Suppose that a function feB depends on at least two variables 
Then any ft-circuit or formula for f rauSt contain at least one 
binary gate (where n is B 2> B, U, or M). By absorbing any 
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unary gates into the binary gates, one cap show that in any minimal 
ft-circuit. for f there are no edges directed out from any constant 
nodes since each gate with one input constant can be simplified 
and eliminated. Similarly, no node 6 has two edges directed into 
the same node H, so outdeg(G) is equaT to the number of nodes in 
Succ(G). 

For minimal formulas, we point out that thenotions of 
structural and functional dependence on a variable x. coincide, 
To prove this, suppose that n is an arbitrary basis, atad F is 
a minimal n-formula for f £ B . If H is a gate in F which 
does not depend functionally on the variable x . , then we claim 

that there are no occurrences of the variable x. in F u . If 

i n 

there were, then one could replace all occurrences of x. in F u 

1 n 
by the constant (or 1) without changing the function, computed 

at H and hence by the entire formula F. Since this reduces the 

number of occurrences of inputs in F, F„ must not contain any 

occurrences of x. . 

Finally, we point out one simple lower bound on the complexity 

of most functions. 



2.11 Lemma (Savage): Suppose a function feB depends on each of 



its variables. Then C„ (f) a. n-1 . 

B 2 
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Proof : Suppose N is a minimal B 2 -circuit for f which 
has k gates. Observe that since each input node has outdegree 
at least one and since there is at most one gate of outdegree 0, 
we have 

'jodes(N) 



I indegree (A,N) = 2k 

AcNc ' ' " 



and I outdegree(A»N) a n-t'k'-l 

AeNodes(N) 



Hence, since 



I indegree(A.N) » 7 outdegree ( A, N) , 

AeNodes(N) AdtodesiN) 



we obtain 2k s n+k-l which establtsh«& thelewna. 

TJ Lemnia 2.tt 



Section D. Circuits and Turing Machines 

One question which naturally arises concerns the relationship 

between the measures of complexity defined above and ordinary 
Turing Machine (T.M.) complexity. As, mentioned above, every finifte 
function is "easy" for a Turing Machine to compute since it may be 
computed by finite state machine. When one considers the finite 
subproblems of an infinite problem, however, the situation is 
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different. For concreteness , consider language recognition 

problems,, i.e. functions f:{0,l}*-* {0,1> (where {0,1}* is. 

the set of all finite-length words over the alphabet {0,1}), and 

let f n be the restriction of f to words of length n. Consider 

the circuit complexity measure C D . 

2 
One can show that there are functions f which are computable 

by a deterministic T.M. in space 2 n (i.e. a relatively easy function) 

for which C g (f n > is near-maximal for all valuesof n (see 

[Stockmeyer 1974] for proof). On the other hand, one can exhibit 

arbitrarily complex, even arbitrarily non-recursive, functions f 

such that C B (f n ) is 1 for every n e 2N. There thus appears to 

be little correlation between the T.M. complexity of a function and 

the circuit complexity of its finite pieces. 

A strong relationship between the two measures comes from a 

result due to Pippenger and Fischer [1977] which asserts the following: 

Suppose f is computable by a T.M. M operating in time T(n) for all n. 

Then there is a constant c (depending on M's state diagram) such that 

C(f n ) £ cT(n)logT(n). The proof is obtained by simulating machine M by 

an "oblivious" T.M. M' which operates in time T-logT - the positions 

of the heads of M' depend only on the input length n. Then it is 

shown that any oblivious T.M. can have its operation on an input of 

length n simulated by a number of gates proportional to its running 

time. 
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This remark has many applications [Paul ISM]. As an example, 
Fischer and Meyer £1971] have shown that Strassen's fast Integer 
matrix multiplication algorithm can be, used to multiply two 
Boolean nxn matrices in time 0(n ; ? logn loglogn legloglogn) 
on a Turing Machine. Using the oblivious construction of a 
circuit simulating the T.M., one thus obtains a B 2 -ctrcuit for flatrix 
multiplication with 0(n 2,82 ) gates, (Recall that Paterson, [1975] v 
and Mehlhorn and Gain [1976] have shown that any M-clrcuit 
for this problem must have a(n 3 ) gates.) 

While Fischer's result is nice as a technical tool, it Is 
obvious by the earlier remarks that the converse result does not 

hold. Schnorr [1976d] has strengthened the result by considering 
oracle T.M.'s In which one work tape initially holds some string 
A in {0,1}*. Define, for f:{Q,l}% {0,1}, 
TC(f)(n) = m1n{|M|.T A (n).log S A (n) | M with oracle A. computes f} 
where U ranges over all oracle T.M.'s (see [Schnorr 1976d] for 
conventions , T A and S A are the time and space required by M, 
and |M| is the number of Instructions in program 11, Then there 
exists some fixed polynomial p and a constant c such that 

C R (f):« c.TC'(f) and TC(f) sp(C R (f)). 
B 2 . B 2 

Section E. Miscellaneous Notations 

Frequently we will be concerned with the rate of growth of a 
numeric function rather than its exact value. Suppose f and g 
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are functions defined on the natural numbers. Then we write 



f ■ 0(g) iff there is a constant c > such that lim sup (f(n)/g(n)) s c 

n-x» 

(I.e. 1f f(n) s c g(n) for all sufficiently large n) f 

f = o(g) iff limit f(o)/g(n) « , 
n+» 

f = n(g) iff there is a constant c > such that lim sup (f(n)/g(n)) a c 

n-H» 

(i.e. if f(n) a c g(n) for all sufficiently large n) s 

f = e(g) iff f = 0(g) and f = n(g), 

f ~ g iff limit f(n)/g(n) =1, 
n-*» 

and 

f ;$ g iff lim sup f(n)/g(n) * 1. 
n*» 

If f = ©(g) we say that f is proportional to g, if 
f ~ g we say that f is asymptotically equal to g, and if 
f £ g we say that f is asymptotically less than g . 

If A is any finite set, then |A| , the cardinality of A, 

is the number of elements in A. A partition of A is a collection 

k 
of pairwise disjoint sets B,,B 2 ,...,B. such that UB. = A. 

If A and B are sets, then A-B ■ {x | xeA and x^B}. 

If X is a set of variables, an X-variable is a member of X. 
N is the set of natural numbers, and p is the set of real 
numbers; R = {reR | r s 0}u{»} 

If kn are natural numbers, the set of consecutive natural 
numbers {k,k+l,k+2 £-"!,£} is denoted [k:*]. 
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If x is a real number, then |_xj is the greatest integer less 

than or equal to x; Hf! is the least integer greater than or eaual 

to x. 

"Iff" is an abbreviation for "if and only if", and "w.l.o.g." 

is an abbreviation for "without loss of generality". 

$ is the empty set. 

k k i 
If k and z are natural numbers, then (■) = , / ' \ , . 
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CHAPTER 3 

Worst-Case Values for the Complexity of Quadratic Functions 

We begin by considering several complexity measures for monotone 
Boolean functions, and examine the complexity of quadratic m.b.f.'s 
under these measures. Recall that a monotone function feB n is 
quadratic iff for each ie[l:m], component f^ of f has a set 
of prime implicants PI(f.j) equal to <x,-x k | {j,k}eA 1 }, where 
A. is some collection of subsets of [l:n] of cardinality two; 
equivalents, f^x,,...,*^) = { jj^ eA Cx j -x |c ). The "threshold 
two" function of n variables t!J is quadratic since 
PI(t£) = *V x k I J.ke[l:n], j f k}. Boolean matrix multiplication 
and Boolean convolution are additional examples. 

In this chapter we primarily consider the measures MC, MC A , 
and MC y for circuits, and corresponding measures ML, ML A , and 
ML for formulas. For any M-circuit N, we have the relationship 
MC (N) + MC (N) = MC(N). For an M-formula F which does not contain 

A . V. 

any constant inputs, we have the corresponding equation 

ML A (F) + ML V (F) = ML(F) - 1 . Hence fof an arbitrary mon otone 

function feB we have the relationships 
n 



The number of gates in an M-formula is one less than the number of 
occurrences of variable and constant inputs. 



and 
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MC A (f ) + HC v (f } * MC(f ) 



ML A (f) + ML v (f) *ML(f) - 1, 



but at least the first Inequality may be strict for specific 
functions f. For example, in Chapter 5 we will show that 
MC.(t3) - Hog ri"l and MC (t!!) = 2n-4, but will also show that 1f 

At V c 

n 1s a power of 2, any M-circuit for Tg which has exactly 
Hog ri~l A-gates must also have at least 2n + 3 Hog if! - 9 
v-gates, and hence any M-circuft for tj" must have at least one 
more gate than the sum MC a (t!|I) + M^(t!J) f *for these values of n. 

Two areas in the complexity of single-output quadratic m.b.f. 's 
are presented in this chapter. The asymptotic value of the largest 
complexity for all quadratic m.b.f. 's 1s explored first. It Is 
shown that every quadratic m.b.f. in B has an M-c1rcu1t 
complexity of 0(n /log n). That this upper bound 1s within a 
constant factor of the best possible result is shown by proving that 
"most" quadratic m.b.f. 's have circuit complexity over the larger 
basis B 2 of n(n /log n). 

The other area is the determination of the number of A-gates 
necessary and sufficient to compute any quadratic m.b.f. 1n B n . 
We show that n-1 A-gates are sufficient in any M-formula, and 
exhibit a quadratic m.b.f. for which {2/3nj A-gates are necessary 
in any M-c1rcu1t, By restricting the types of N-circults allowed, 
we can demonstrate a quadratic function for which n - o(n) A-gates 
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are necessary in any restricted M-circutt, due to an observation by y.Chvatal 

Section A. The Total Number of Gates 

We first demonstrate closely matching Upper and lower bounds 
for the worst-case complexity of quadratic monotone Boolean functions. 
Definition: Suppose neJN . Define 

QC(n) = max{MC(f)|f is a quadratic m.b.f. in B }, 

QC A (n) = max{fC A (f)|f is a quadratic m.b.f. in B }, 

and QC y (n) = max{MC v (f)|f is a quadratic m.b.f. in B n ). 

We similarly define QL(n), QL A (n), and QL y (n). 

We can use standard counting techniques as found in numerous 

proofs [Shannon 1949, Lupunov 1962, Fischer 1974] to establish lower bounds 
on QC. 

3,1 Theorem : Suppose * > 0. Then most quadratic m.b.f. 's f in B ft 
have 

C B ( f ) * - e)n 2 /(4 log n) 

for n sufficiently large, where B 2 is the complete basis of all 
two -input Boolean functions. 

Proof: Let Q n £ B n denote the set of all quadratic m.b.f.'s 
of n variables. Since there are (J) subsets of the 
variables {x ] .x^,. ..x n > with two members, |Q | « 2$ - 1. 

Suppose f e Q n . We know that there is some B 2 -circuit N which 
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computes f since Bp is a complete basis. If N contains two 
gates which compute the same function, then as in Chapter 2, one of 
these may be eliminated to yield a B 2 -circu1t with fewer gates which 
also computes f. Also, if f 1s not a constant function, It is pos- 
sible to construct a Bp-circuitfjor f m wh4dfc jaach constant node 
has outdegree zero. We thus consider only BgNctrcuits in which each 
gate computes a distinct function, and in wh Inconstant nodes have 
outdegree zero ; we cal 1 such circuits reduced. 

Now suppose q e iN and N 1s a reduced B-Tclrcuit of size * q 
which computes a function f«Q . By possibly adding additional 'tiummy " 
gates, we may assume that there 1s a reduced Bg-circuit of size ex- 
actly q in which one of the gates computes f. Let R Q denote the 
number of quadratic m.b.f. *s computed by such reduced circuits of 

size q. Since each gate has two Inputs, and can be labelled with any 

■ .-.:" ■. " "' '■ ■■■■' ' 2 

of the 16 binary Boolean functions, there are at most 16(q + n) 

possible assignments of function-input pair combinations for each 
gate in a reduced B 2 -circuit, giving a total of at most [16(q + n) \J q 
such assignments of reduced Bp-circults. However, many of these la- 
belUngs describe the same circuit -- in fact, there are ql differ- 
ent enumerations of the computational no^des of any jftne4< reduced circuit. 
Hence, there are at most 16 q (q + n) q /q! reduced circuits of size 
q. Since the (quadratic) output function may be computed at any gate of 
the circuit, we have 
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R q * q- 16 q (q + n) 2q /q! 

s (16 q (q + n) 2q+1 -e q )/(^i?q.q q ) (using Stirling's 

approximation ) 

s /(27^(64e)S q+ ^ 2 > (when q * n) " 

* (cq) q . { for some c a and q suf- 

ficiently large ) 



Now, suppose that q s (l-e)(J) / log (!|). Then 

«\,^c(l-«)(«)/log(5)3 0-)(^/ l0 9 <2> 

s ( n, (W)(?) / log (»> (when log( »), c(1 _ s)) 

, 2 (l-e)( 2 n ) . 



Hence the fraction of all quadratic m.b.f. 's counted in R is 

which tends to as n ■■-»■ ». Thus most quadratic functions have B 2 
complexity at least (l-e)ty / log <jj} : *> (l-t)(n 2 )/(4 log n). 

FJ Theorem 3.1 



3.2 Corollary : QC(n) = n(n 2 /log n). 



Proof : For any quadratic nub.f. f, MC(f) a C R (f) since 

B 2 
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M £ B 2 . q Corollary 3.2 

We can prove that alt quadratic, m.b.f.'s can be computed using a 
mpnotone circuit which is no more than a small constant factor larger 
than this bound. The idea of the proof 1s similar to that used in 
the Four Russian's algorithm for transitive closure [Arlazarov et al. 
1970], and has been used previously for constructing efficient cir- 
cuits for other classes of functions [Tarjan 1976]. 

3.3 Theorem ; Suppose f e B n fs a monotone quadratic Boolean function. 
Then 



MC(f) s 4 n 2 /log n + 0(n). 



Proof: Let m ■ (log nj. We partition the input variables of 
f into r n /nP subsets, the 1 th subset S i consisting of the set 
S 1 " tXj|(1-l)jn <js I'm, jii)} for 1 fi £l:n/m]. Each subsfct, with 
the possible exception of the last, has n variables in it. If f«Q n 
1s given, for each input variable x 1 and each subset S, we define 

A 1j " { V S jl k * 1 and x iV PI ^ f ^* 

Note that A^ « for j < n/nfl Let A^ be the Boolean function 
of the disjunction of all variables 1n A^. (recall that by conven- 
tion Ajj 1s the constant function when A- 1 « 0). We clearly 
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can express f as the factorization 

n r n /m"I 

f(x r ,...,x n )= V (y^ ^V) 



(1) 



Our goal will be to compute the functions A., efficiently, and 

I J 

use the decomposition (1) to compute f. We make use of the following 
result [Tarjan 1976] which allows efficient computation of sets of 
disjunctions. 

3.4 Lemma : Suppose X is a set of m variables, and f r f 2 f k 

are monotone Boolean functions which are the disjunction of subsets 
of variables of X. Then there exists a circuit N consisting solely 
of v-gates which computes F = {f r f 2 ,...,f k > such that 
C {V} (N) < 4km/log k . 

Proof of lemma : Let r = Qog _£. We decompose the set of X var- 
iables into r~m/r~l subsets ^ ,T 2 ,... .Tj-.-, each containing no more 
than r variables. For each i, the circuit N initially computes 

the disjunction of all possible subsets of T.. This can be done in a 
manner which has one disjunction for each subset of size * 2 by com- 
puting the disjunction of smaller subsets first, and then constructing 
the disjunction of a larger set with one additional variable by using 
a single v-gate to add this variable to the previously constructed 
disjunction of the smaller set. Hence, this can be performed using at 
most 2 - r - 1 disjunctions for each ie[l:ffi/rT], giving a total of 
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(2 - r - l)(Tm/r~!) < W*m/r~l disjunctions . Since each function 
fj can be constructed by confuting the disjunction of Rn/rl of 
these previously computed functions {one per subset T.), we can com- 
pute F with at most an additional k(Tm/r~l - 1) v-gates. Since 
Tm/rl s 2m/log k, the total number of v-gates used in this construc- 
tion is at most 2kTm/r~l -ks 4km/log k. 

D Lemma 3.4 
Returning to the proof of Theorem 3.3, suppose that j is 
fixed. Note that A^ t # for at most j«m values of i, namely 
for each ie[l:j'm]. Hence the computation* of the functions A., 
for all j.-Hj i values of i can b& accompli shed using the method of 
Lemma 3.4 with at most 4jm /log (jm) v-gates. Thus, one can compute 
all functions A^ for ie[l:n], J€[l:rn/nfTl usingiat most 

Hi/ml ? ,rn/ml * 

s (4jnr)/(log(jm)) » 4rrf ( t j/log(jm)) 
j=l j*l 



4m 2 [n 2 /(2m 2 log n) + 0(n/log 2 n)] 

2 
2n /log n + OfhJ v-gates. 



* 
To sum the series, note that 



rn/nfl 



y j/log (jm) < J 2 n/n Wlog(xm))dx + 2n/(m log n) 

o n 

* (1/m ) j z x/log x dx + 2n/(m log n) 

(cont. on next page) 
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Finally, combining these disjunctions to compute f using (1) 

can be done using an additional n - 1 A-gates (stnce % . = 

nj — 

for all j); the number of additional v -gates is at most 

n n 

I (Tn/nn - n/ml) + n - 1 s r((n-i>/m +1) + n - 1 
1=1 1^1 

n-1 

- ( I S)/m* 2n - 1 

j=0 

- n 2 /2m + 0(n). 

Again, since n /m s 2n /log n, we have a total of 4n /log n + 0(n) 

gates in the M-circuit constructed. 

D Theorem 3.3 

Section B^ Asymptotic Bounds on A-gates 

In the remainder of this chapter, we consider the measures QC 

~'" 1 ■■■<■■■,■• : A 

and QL A which are the minimal number of A-gates required in any 



But 



I x/log x dx ±1 i/log i * f [1/log ^ + ( n . 1+2 )/ log(n-i+2)] 



U/2J+1 
. I 2rn/2">l)/log(n/2 + 1) 

= 2(^1/2^(^/2^-1 })/{2 log (r n /2->l)) 
= n /2 log n + 0(n/log n). 
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clrcult or formula respectively to express * quadratic m.b.f. of 
n variables. We do not consider the measures QC and QL , and 
consider determination of their valuescto be ' ; «Mhtefestfng open 
question. When arbitrary M-circu1ts are allowed, the bounds we obtain 

on A-complexity are not as tight as those Hi\ the preVfbui section, 

but tighter bounds are given in Section C when 

A-gates are not allowed t* have a path to another A-gate 1n the 
circuit. 

3.5 Lemma: Suppose n k 2. Then QC (n) & QL . (n) k Ji-]. 

£roo_f: Obviously QC A (n) s QL^)? sllrrce a formula maybe 
considered a special kind of a circuit. 

We prove that QL A (n) * n-1 by induction on n. 

If n - 2, then clearly QL A (2) = 1 since the only quadratic 
m.b.f. of 2 variables Is f|(x r x 2 ) - X] a x 2 * 

Now assume inductively that QL A (n) s n-1, and suppose that 
f is a quadratic m.b.f. of n+1 v*r4ables. We can express r f 
as the factorization 

f(x T ,...„x n+t ) * (x^ a g(x r ... f x n )) v hCxp...^) 
where g = V{ Xi | x^x^cPKf)} and 
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h = V{ Xl -Xj | Xj-XjCPHf), 1 f n+1, j f n+1} 

Clearly ML A (g) = since it is simply a disjunction of single 
variables. Also ML A (h) s QL(n) since h is a quadratic m.b.f. Of 
n variables. Inductively, we then know that ML (h) s; n-1 , so 



ML(f) s ML A (g) + ML(b) +1 s n. 

A A A 



Lertma 3.5 



Remark: One can show that QL A (4) = 2 by examining all quadratic 
m.b.f. 's of 4 variables, and hence a similar proof demonstrates 
that 0L(n) <. n-2 for n * 4. 

A 

We would now like to determine whether there are quadratic 
m.b.f. 's which require a number of A-gates which is close to the 
upper bound provided in Lemma 3.5. It is easy to demonstrate a 
quadratic m.b.f. in IT for which fl(n) A-gates are required. 
For example, suppose n is even and consider the function 
fg(x,,...,x ) = (x,ax 2 ) v (XjAX*) v ... v (x_-|AX ); the proof that 



t 
We remark that the number of prime implicants of a quadratic m.b.f. 

correlates poorly with its A-complexity since (as we show in Chapter 

5) the function T 2 , which has (!)) prime implicants, only requires 

Hog n~l A-gates to compute. 
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f Q requires n/2 A-gates to compute is relatively straightforward. 
It is an open question whether QC (n) is asymptotic with n. 

A 

Our objective is to show that the constant factor of the trivial 
lower bound of n/2 may be improved to 2/3. 

3.6. Theore m: Suppose n > 2. There is a quadratic m.b.f. feB 
such that MC (f) = [2n/3| . 

Proof : We define f as follows: partition the n variables 
into k = 'n/3 1 subsets S, ,S 2 ,. . . ,S. , each containing 3 variables 
(except possibly S^). We consider the function 

f(x r ...,x n ) = T^S-,) v T^(S 2 ) v ... v T^(S k ) (2) 

where T^ (S.) is the threshold 2 function of the variables in S. 
(for the set S. , if n is not evenly divisible by 3, we consider there 
to be 1 or 2 "dummy" variables set to the constant 0). We 
claim that MC (f) = j2n/3j , and prove this result by induction on 
k in the event n e 0(mod 3). The other cases n = 1 (mod 3) and 
n = 2(mod 3) are proven similarly. We first show that 
MC -(f) > 2n/3 . 

Suppose k = 1 and n = 3. Then f(x,,x 2 ,x 3 ) = T 2 (x, ,x 2 ,x.J . 
By Theorem 5.9 in which it is shown using other means that 
MC (t") = iTog hi , we have MC (f) = ilpg 9 3] = 2. 

Ac. A £ 

Now assume the statement true for k and n = 3k -- namely 
that the function 
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f(x r ...,x n ) - T^S^ v ... vl|(5 k ) 



has MC (f) 2 2k. We are concerned with the function 



f (x r ...,x n ,y r y 2 ,y 3 ) = f(x 1 ,...,x n ) v l|(y 1 .yg^) 

where we have re-named the variables for ease of description. 
Suppose N is an M-circuit computing f. The general 
aim of the proof will be to show that there are constants a, ,a« 
and a, in {0,1} with the following property: if y. is 
set to a. (for i =1,2, and 3), then the resulting circuit N' 
computes f; moreover, it is possible to eliminate two unnecessary 
A-gates from N' since each gate has at least one predecessor 
which computes the constant function 1_. We proceed in a fashion 
similar to Paterson [1976]. 

Definition : Suppose N is a circuit which computes f", and 
suppose i,J£{l,2,3}. We say a node G of N has property P. . 
iff y.j y i is a prime implicant of Res(6,N). 

Definition : If P is any property on nodes of N, we define the 
initial occurrences of P, denoted I(P), to be 

I(P) = {GeNodes(N) | G satisfies P, but no predecessor of G 

satisfies P }. 
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3.7 Lemma : Suppose N is an M*drcuit which computes f", and 
1,J€{1,2,3} , 1 f j. Then 



(D I(P ifj )M 
and (2) I(P, A consists solely of A-gates. 



Proof: Since the input variables do nut satisfy P_. 4 and the 
output gate of N does, by passing up the circuit N from the 
output gate we must eventually find a gate which satisfies P 4 . 
but none of whose predecessors do. This establishes statement 
(1). For statement (2), suppose that G is an v-gate which sat- 
isfies P. , , and let Pred(G.N) = {J,K}. By the remarks in 

Chapter 2 on prime implicants, we know that PI(G*N) c PI(0,N)uPI{K,JN), 
and since y.y.ePI(G,N), one of J or K must satisfy P. <• 
Hence G/I(P. J. 

U Lemma 3.7 

3.8 Lemma : Suppose N,i, and j are as above, and- GdCP, .). If 
Pred(G) = {J,K}, then either y^PIfJ.N) and yjcPI(K.N), or 
vice versa. 

Proof: By Lemma 3.7, If GeI{P. 4 ), then G is an A-gate. 
— — ~ i , j 

Every prime irapTicant of G is a product of a member of PI (J) and 
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a member of PI (K). Since neither PI (J) nor PI(K) contains 
y^Yj. we must have either y^PIfJ) and YfePi(K), or vice versa. 

D tttiina 3.8. 

3.9 Lemma : If N is as above, then l{?^ 2 )uI(P 2 3)^1(^1 3 ) * 2. 

Proof : Suppose to the contrary that there is Only a single gate 
G in HP lt2 )uI(P 2(3 )uI(P 1 j) t . and suppose Pred{G,N) = {J, K} . 
By Lemma 3.8, we may suppose w.l.o.g. that y^'PlGM}' and 
y 2 ePI(K,N) since GeI(P 1 2 ). Again by virtue of Lemma 3.8 and 
the fact that G€l(P 2>3 ), we must have y 3 ePI(JiN-) since 
y 2 ePI(K,N) implies that y 2 ePI(G,N), and hence that y^PUG.N), 
contrary to the assumption thatGeI(P r 2 ). But 1*en G^I(P ] 3 ) 
since y^ cannot be a prime implicant of 

Res(G A N) = (y 1 v y 3 v h ] )A(y 1 vh 2 ). This contradiction implies that the 
union contains at least 2 gates. 

D Lemma 3.9. 



We are now in a position to prove the main theorem. Since 
there are at least two distinct A-gates in I(P ] 2 )uI(P 2 3 )uI(P, 3 ) 
and since none of these sets is empty by Lenrna 3-7, we may w.l.o.g. 
assume that there exist distinct gates G and H in N such that 
GeI(P 1 2 ) and HeI(P 2 3 ). Assume using Lemma 3.8 that 
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Pred(G,N) = {Kj.Kg} and <y 2 ePI(K r N); and similarly that 
Pred(H,f() = {K 3 ,K 4 } and y^PI(K 3 *N>. (It may be possible that 
some of the ..JC images 'are not distinct.) The assignment of the 
variables y ] » 0, y 2 • 1, and y 3 = results 1n a circuit N' 
which computes the function f. Moreover * in N' the inputs K,' 
and K 3 to G and H respectively now compute the constant function 
I, and hence gates G and H may be eliminated fn N" slwce 
I a g * g for any Boolean function g. Hence If the resulting 

circuit is called N", we have 



MC.(N) a MC(N") +2i HC(f) + 2 = 2n + 2 

A A A 



by induction, -and the lower bound is complete. 

To show that C A (f) *. t(2/3)nj , it suffices to observe that 



3 
T 2 (x ,x 2 ,x 3 ) = ((x 1 vx 2 )ax 3 > v (x^AXg) 



3 
and hence C A (T 2 ) s 2. Thus using the definition (2) one can 

compute f using 1(2/3)15 A-gates. 

D Theorem 3.6 



: 3. 10 Corollary : Suppose n ;> 4. Then 

l(2/3)|S . s QC A (n) <; QL A (n) <. n-2 . 



-69- 
Section C. A Graph Problem 

We close this chapter by describing a graph -theoretic problem 
which is related to the A-complexity of quadratic m.b.f.'s when 
they are computed with a restricted class of M-cirCuits. The 
research in the remainder of this chapter was done in conjunction 
with Ronald Rivest. We introduce ndtatfon similar to Harary [1969] 
for undirected graphs. 

Definition: An undirected graph G is a pair (V,E), where V is 
a finite set whose members are called nodes or vertices , and 
E c {B c v ||Bj = 2} is a set whose members are called edges . 

A bipartite graph, is a graph G = (V,E) whose vertices can 
be partitioned into two non-empty sets Vj and V 2 such that if 
edge {v^v^eE, then v^V, ~ v^V,,; that is, each edge consists of 
one point in V 1 and one point in V ? . 

A complete , tri. bjp^rJJJ^. grajih- is a bipartite graph 
(V = V^uVg.E) such that | V 1 1 = k |V 2 | = £, and 
E = {{v r v 2 } | v 1 eV 1 , v 2£ V 2 } . 

Definition: If G == {(y } .EjMY^Eg),... ,(V k ,E k )} is a collection 
of undirected graphs, and G = (V,£) is an undirected graph, then 
an exact covering C of G by G is a collection of one-to-one 
mappings h.:V i •* V for f £ [l:k] such that E = U , {h (e) |e*E.} 
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where h({a,b}) = {h(a),h(b)} for any edge {a.McE- The cost of the 
cover , denoted D(C), 1s k. 




W 




not bipartite bipartite complete 

3-3 bipartite 

Ffg. 3.1 Some Graphs 
Points represent nodes and lines represent edges. 



We can now define a measure on graphs. 

Definition : Suppose neN , and G is an n-node undirected 
graph. -Then 

D(G) = min{0(C) | C is an exact cover of G by {G-j .....G^}, 

where G| 1s » complete bipartite graph 
for 1e[l:kl} . 

Define H(n) ~ max{D(G) I G is an n-node undirected graph}. 
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The connection between quadratic m.b.f. 's and graphs is as 
follows: Suppose f is an n- van able Boolean function. We can 
associate a corresponding n-node undirected graph G f = (V,E), 

where V = {1 ,2 n} and E - {{i,j} | x.x.ePI(f)} . Conversely, 

suppose G = (V,E) is an n-node undirected graph. By possibly 
renaming the vertices, we may assume V = [l:n]. Define the Boolean 
function f G :{0,l} n -^ {0,1} by 

f G (x } x R ) - v fcyXj.| {UiUE} . 

The functions f ■> G f and G ■+ f fi clearly demonstrate a 
one-to-one correspondence between the set of quadratic m.b.f.'s in B 
and the set of n-node undirected graphs with at least one edge. 
See Figure 3.2 for some examples. 




'/-$ "/^ 7 /\ 9 

T2(x 1 ,x 2 ,x 3 ) v T2(x 4 ,x 5 ,x 6 ) v T 2 (x 7 ,x 8> x 9 ) 



Fig. 3.2 Some Functions and their Graphs 
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In fact, we can get a more strict correspondence in mathematical terms. 



Notation : Q n - {f J f 1s a quadratic m.b.f. of n variables} u 
{0, the constant function of n variables}. 
G„ = {6 = (V,E) | G is an sndirected graph, and V * [T:n]}, 
Z n is the graph ([l:n],0) in <? n . 

If G i = (V,Ej) and V* ( V *V e< V defin< * ^l uG 2 t0 
be the graph (V.E^E.,). 

If A is any set, and f:A + B, then f(A) = ff(a) | aeA}. 
If G = (V,E) is an undirected graphv and h:V -^W, 
then h(G) 1s the graph (h(V),h(E)). 

3.11 Lemma : Suppose n * 0. Then 

(1) (Q n »v»0 ) and (G n »u,Z n ) are monoids, 

and (2) the function f -»■ G* is a monoid isomorphism whose 
inverse Is the map G -► fg. 

Proof : Statement (1) is easily checked, as are the facts that 
the maps in (2) are one-to-one, onto, and Inverses of each other. 
Moreover, if fj and f 2 cQ n » then 

PI(f lV f 2 ) = PK^) u PI(f 2 ) 
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since each prime implicant is the product of two variables. Thus, 
if 

\% = (Cl:n] ' E) » 6 f = (n-n].E T ),and G f ■- ({l;n],E 2 ), it 

is easy to check that E = E ? uE 2 , and the claim is verified. 

D Lemma 3.11 

3.12 Corollary: Suppose C = (h r ...,h k ) is a covering of the 
graph G by graphs G ] ,G 2 G k « Then 

V^) v f h 2 (G 2 ) v .» v f hk(6k) . 

Proof: Proveil inductively using Lemma 3.11. 

D Corollary 3.12 



We now connect the complexity of functions and their graphs. 
We will say an M-circuit or M-formula N is single-level if 
there is no path between any pair of A-gates in N. Obviously, any 

quadratic m.b.f. can be computed by a single-level M-formula and 
hence a single-level M-circuit. In a single-level M-circuit N, 
each immediate predecessor of every A-gate consists of the 
disjunction of some set of input and constant nodes. Moreover, if 
N computes a quadratic m.b.f. f, then there is some subset a. of 
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the A-gates of N such that f -^v^^fN). Thws, given a 
single-level M-circuit N computing a quadratic m.b.f. f, one nay 
construct a single-level M-formula F which also computes f and 
which has the same number of A-gates astN since" computing anyr 
number of disjunctions of variables takes no A-gates. 
If f Is a quadratic m.b.f., define 

SLC(f) = minfMC (N) J N is a single-level M-circuit which computes f}, 
We shall prove 



3.13 Theorem : Suppose f is a quadratic m.b.f.. Then SLC (f) = 

| t A 



Proof : Suppose f is a quadratic m.b.f., and that 
D(Gf) . - k. Let C « (^.....h^) be an exact cover of 6 f by 
complete bipartite graphs G, ,...,G. . He will construct a single- 
level M-formula for f which has one A-gate for each graph in 
the cover pf Gf. Suppose G 1 ■ (V^ ■ ^ T gy. 2 ,E r ) i* the decompo- 
sition of the verticesof G| as 1n the definition of bipartite 
graphs. 



Note that h^(G^) is also a complete bipartite graph. Since each 
graph is complete, 
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■-.mxj i jch^v.,)}) A (v { x k i kch^)}), 



Hence, using Corollary 3.12, we can express f 



as 



k 
" & [(Vt.xj I j e h.(v f1 )}) a ( U Xfc | k e h.(V i2 )})] 

which is a formula with k A- ga tes. Thus SLC A (f) .* k. 

Conversely, suppose f Is a quadratic m.b.f., and SLC (f) = k. 
By the remarks preceding the theorem, there is a single-level 
M-formula F for f such that Myf) = k. Since F Is single- 
level, we may write 

k 



F = 1=1 c Va x j° a ( v X k )] 



for some subsets A 1 and B^ of [l:n] for each i e [l:k]. S1 
PI(f) has no single variable terms, we know that A.nB,. = for 
each 1e[l:k]. Thus 

k 
f(x) = i=i {x j x k I j ~ A i and keB i>- 
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So if we define G^ to be the compete bipartite graph with vertex 
set A^uB^ (decomposed as A, and B.) for each i e [l:k],we 
observe that C = (id,id,...,1d) 1s an exact cover of G f by 
complete bipartite graphs G, ,G 2 ,...,G. , where id 1s the Identic 
map. Hence D(G f ) s k. 

D Theorem 3.13 

If we define SLQC (n) to be the maximum value of SLC(f) 
for qny quadratic m.b.f. of n variables^ then t Theorem 3.13 
implies that SLQC A (n) = W(n). Since ^ny H s1nfl§-le?el M*fonaula 
is in fact an M-formula, we know QL (n) ^ SLQC (n). We note 

A A 

that the proof given for Lemma 3^5 actually yields a single-level 
M-formula for ^ aay quadratic m»b . f., s©: StQ€ (tt)WiT-l. Since 

A 

QL (n| = e(n), we know also that SiQC fn) *• e^(nj. For the latter 
function, it was pointed out to the author by V. Chvatal + that one 
can actually get asymptotically matching upper and lower bounds on 
tf(n). - > 



3.14 Theorem : SLQC (n) « H(n), and ff(h) ~ n. 

A 



We Include a sketch of the proof of Theorem 3.14 in Appendix 1. 



+ Pr1vate communication, 1977. Recent work by Bermond [1978] has 
tightened the bound given on H(n) . 
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Section D. Open Questions 

At the end of each chapter, we list some open questions pertaining 
to that chapter. 

1. Demonstrate a m.b.f. for which ML A (f) + ML (f) < ML(f) - 1. 

2 " 2 

2. Is QL(n) = e(n /log n)? More generally, do o(n ) 

monotone formulas exists for all quadratic m.b.f. 's? QL(n) is 

2 
obviously 0(n ). 

3. What is the asymptotic behavior of QL Y (n) and QC v (n)? 
Determine QL and QC more exactly than given in Corollary 3.10. 

4. If f is a quadratic m.b.f., is SLC A (f) = ML A (f)? Is 
ML A (f) = MC A (f)? 

5. More generally, is SLQC (n) = QL, (n)? 

A A 
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CHAPTER 4 
Combinations of Functions 

One fundamental problem of Boolean function complexity is deter- 
mining the relationship between the complexity of individual func- 
tions and the complexity of combinations of those functions. The 
relationship depends not only on the type of combination but also 
on the model and measure of complexity used. In this chapter, we 
explore several questions in this area. 

Given Boolean functions f and g, we consider primarily the 
combinations fxg and v(fxg). Results about the function A(fxg) 
may be obtained by duality from those for v(fxg). we do not consider 
the composition of functions; this area has many interesting open ques- 
tions. The measures which we consider are formula and circuit size 
over different bases. 

As Paul [1976 p. 383] points out, one might expect that 
C B (fxg) = C B (f) + C B (g) since the evaluation of f and g on 

disjoint sets of variables "have nothing to do with each other." This, 
however, is incorrect; Paul shows that for any e > 0, there are ar- 
bitrarily complex functions f in B such that 

n,n 

C B (fxf) <: (1 + £ )C R (f). In addition, he exhibits arbitrarily com- 
2 D 2 

plex g in B R such that C B (v(gxg)) <; (I + g )c & (g). 

For M-circuits computing monotone functions, the results are 
drastically different. M. Fischer proved that 



MC(f *g) = MC(f) + MC(g) for any m.&.f.'s feB . and g e B + 

[Paul 1976]. It is still unknown whether MC(v(fxg)) = MC(f)+MC(g)+l. 
In Appendix 2, we present an extension to Fischer's result, due to 
Galbiatl and Fischer; [1 97a] in which, the sets of variables on which f and 
g depend have one variably tn common, in addition, we show that 

Fischer's result holds when v-gatesafcd A-ga^es am counted separ- 
ately; namely that MC A {f*$) .» m^) + MC A (g)? and dual! y 
MC v (fxg) -MC v (f) +«C y {g). 

We consider the question of formula size additivity; the only 
combination considered Is v(fxg). Here, 1n contrast to Paul 's 
results on circuit size, we show that formula size 1s additive under 
v, namely 

for any basis a which contains v, and any non-constant Boolean 

functions f € B n and geB m . When counting individual types of gates 

In formulas over the monotone basis M, we show tfiat 

ML v (v(fxg) • ML y (f ) + ML v (g) + 1 for arbitrary monotone functions 

f In B n and g In B m ; the corresponding question 

^aM^S) * ^-a^ + l*-*^ remains open* even for simple classes of 



tin fact, Fischer shows that any wttrimaj M-clrcuit for fxg 1 S 
composed of disjoint minimal circuits for f and g. 
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f unctions such as quadratic functions. 

Finally, using fact (1), we demonstrate a gap between the circuit 
and formula complexity of some specific functions. In particular, we 

show that there is a sequence of functions whose fU-elrcuit complex- 

2 
ity is 0(n log n), but whose B 2 - formula complexity is 

2 
ft(n /log log n). A slightly larger gap between these measures for a 

particular function had been previously shown using different techniques 

by Paul [1977] + , who observed the implications of fact (1). 

The following chart summarizes the current state of what is known 

++ 
about these problems . 



Worst-case results for these measures show that most functions 

f e fi have L n (f) = n(2 n /log n) and C„ (f) = 0(2 n /n). Hence for 
n « 2 B 2 

most functions the ratio between these two measures is o(n/log n), 
but the measures themselves are exponential in n. 

+ Paul's result on the sub-additiv1ty of x in the measure C D used 

2 
multi-output functions. 
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Table 4.1. Summary of results. 

Key - ADD: combination of minimal circuits/formulas 

gives optimal circuit/formula for combination 
SUB: negation of ADD for some particular functions 
?: open 
-: does not apply 

Section A^ The formula. Size, of v(fxg ) 

In this section we show that one cannot economize when constructing 
a formula for v(fxg) i n an arbitrary binary basis. 



4.1 Theorem: Suppose n and m e]N , f e B n and geB are non- 



m 



constant Boolean functions, and a is an arbitrary Boolean basis. 
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Then 

L fl (v(fxg)) aLft (f) + L n (g). 

Proof: Suppose F(x 1 ,...,x n ,y 1 ,... t y ) is an n-formula for 

v(fxg)(x 1 .... 1 x n ,y 1 y m ) = f(x ] .... ,x n )vg(y 1 y m ). Since f 

and g are non-constant functions, there are constants ae{0,l} n 

and ^{0,1}"' such that f (a) = q{t) ■* 0. Thus F(x 1 ,... ,* n >t) , 
even when simplified by absorbing constants into gates, is an 

Q-formula for f(x)vg(&) = f(x), and hence has at least L (f) 

occurrences of variables from {x^-.-.x }. Similarly, since 

F(a,y 1 y m ) is a formula for g(y), F has at least L (g) 

occurrences of variables from {y,,...,y >'. Since the sets 

Cx 1 ....,x n } and {y-,,...,y m } are disjoint, F has at least 

L ft (f) + l_ n (g) total occurrences of variables. Since this is 

true for any formula F for v(fxg), the theorem is proved. 

D Theorem 4.1. 



4.2 Corollary : If n is any basis containing v, and f and g 
are as in Theorem 4.1, then L ft (v(fxg)) = L ft (f) + L ft (g). Particular 
examples are n = B« and n = M . 
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To obtain the next corollary, we state without proof the following 
technical lemma due to Paul [1976]. 

N otation : For g £ B n , and kslN , k > 0, define Vg k by the Inductive 
rules: 

Vg « g 

Vg k m v(g*(Vg kV? )). 

k 
(So Vg is the disjunction of k copies of g on disjoint sets of 

variables.) 



4.3 Fact (Paul) : There is a constant a such that, for all k 
and n e K, k .;> 1, and all geB^ the following holds: 

C B (Vg k ) s a.max(n-2 n ,nk log 2 k) . 
2 



We also need the following fact due to Krichevskii [1961]. 



4.4 Fact : There is an n Q clNsuch that for every n > n^,, there is a 
function f € B n such that L B (f) * 2 n /(2 log n). 



We now can establish the following gap between circuit and for- 
mula complexity. 
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4.5 Corollary (Paul, Bloniarz) : There are constants n Q and a 
such that. if in > n Q is a power of 2 and n = 2 m /m, then there is 

a function fcB„ such that 
n 

(1) L R (f) M2 2m )/(2m 2 .log m) 

2 

while 

(2) C R (f) s a»m.2 m 

D 2 

Hence L g (f) = n(n 2 /(log log n)) whereas Cg (f) = 0(n log 2 n). 

Proof: For this proof, we assume that the basis of operators 
for all formulas and circuits is B 2< Let n Q be as in Fact 4.4. 
There is thus a function g e B m such that L(f) £ 2 m /(2 log m). 
By induction on Theorem 4.1, it is. easy to show that L(Vg k ) = k-L(g), 
so if we define 

f = V(g 2 > 2 ), 

we know that L(f) ;> (2 m /m 2 ).(2 m /(2 log m)) ? 

On the other hand, by Fact 4.3, there is a constant a such 
that 



a a*m*Z 



C B (f) * a>max(m-2 m , m-2 m -(m-2 log m) 2 /rn' 
%m 



The final remark, follows since f has n *= (2 m /m 2 )«m variables and 

since m < 2 log n. □' Corollary 4.5 
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Section B^ Monotone Functions 

We now restrict ourselves to the class of single-output monotone 
Boolean functions and formulas over the basis M. For the combin- 
ation v(fxg), Corollary 4.2 proved that raono tone formula complex- 
ity ML is additive since v e M. As for counting individual gates, 
clearly ML A (v(fxg)) s ML A (f) + ML A (g) for any m.b.f. 's f and g. 
It remains an open question as to whether there are* any non-constant 
m.b.f. 's f and g for which the inequality is strict. The cor- 
responding question for ML y is answered by the following theorem. 

4.6 Theorem : Suppose n and meN , and f € B and g e B are non- 

n ■ n 

constant m.b.f. 's. Then 

ML v (v{fxg))- ML y (f) + ML v (g) +1 

Proof: The fact that ML v (v(fxg))s ML y (f) + ML y (g) + 1 is 

easily verified by combining an v-minimal M-formula for f with 

one for g by an v-gate. To show the inequality in the opposite 

direction, suppose that F(x,y*) /is a ^formula for f(x")vg(y) 

with the minimal number of v-gates. We will prove that if H is 

any A-gate of F, then H depends 1 " only on variables from the set 

X B {x, x } or only on variables from„Y ■ {y,,...,y }. 

i ii "■-,.. l m 



We remark that in a minimal formula, the notions of structural 
dependence and functional dependence coincide. 
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Supposing this for the moment, we show that the theorem follows 
from this claim. Let H = {H|H is a variable or A-gate of F such that 
Succ (H,F) contains no A-gates}. By assumption, no node in H depends on 
variables from both X and Y. We look at Succ + (ff,F). If 
Succ (H,F) were empty, then since F is a formula there would be a 
unique node H Q in H. But then Res(H Q ,F) = v(fxg). Since v(fxg) 
depends on variables from both X and Y, and by assumption H 
does not, this is a contradiction. Hence Succ + (H,F) is a non- 
empty collection of v-gates, one of which is the output gate U 
(see Figure 3.1(a)); moreover 

Res(U.F) = H v H Res(H,F) . (2) 

We will replace Succ + (H,F) with a tree of v-gates which also 
realizes the function (2) and obtain a (possibly different) formula 
for Res(U.F). Let H^ denote the set {HeH|H depends on variables 
in X}, and let W Y = {Hetf|H depends on variables in Y}. By assump- 
tion, the pairs of sets H x and tf y is a partition of H. Form 
the sub formulas 

F = V F 
X H^Wu H 

and 

F = V F 
r Y HeH^Y 

(where the disjunction may be associated arbitrarily), and finally 
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the formula F' * F x vF y . Since Res(F) = (,V Res(H,F)) v (^ Res(H ,F), 

F' defines the same function v(f*g) as F. Moreover, since F 
was v-minimal, Succ (H,F) contains exactly ftf|-l v-gates; 







(a) (b) 

F1g. 4.1 The re-arrangement 

since |W|-1 v-gates were added in the reconstruction, F' also 
contains a minimal number of v-gates. But f(x lt ...,x )vg(y.,...,y ) 
■ Res(F x )(x,y)vRes{F Y )(x,y). Since Res(F x ) depends only on X and 
Res(F Y ) depends only on Y, it is easy to show that f « Res(F x ) and 
g - Res(Fy). Since the subtrees F x and Fy are disjoint, there 
must be at least ML v (f) v-gates in F x and ML y (g) v-gates in F y , 
giving a total of ML y (f) + ML y (g) + 1 v-gates in F', and hence 1n.F. 
To show that all A-gates in F depend on only one of X or Y, 
we suppose to the contrary that some A-gate depends on both X and Y 
variables. Let H be a top-most A-gate with this property, i.e. H 
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1s an A-gate such that H depends on some variable in X and some 
variable in Y, but any A-gate H' in Pred + (H,F) depends only on 
X-variables or only on Y-variables. We will use techniques similar 
to the first portion of the proof to restructure the formula and 
reduce the number of v-gates in F. 

Suppose that Pred(H.F) = {I, J}. Define 



A(I) = {GePred (I,F)|G is an A-gate or an input and all interme- 
diate gates (if any) on the path from G to H are v-gates} 



and define A(J) similarly. Note that neither A(I) nor A(J) ■. 
is empty by our selection of H. Since the portions of the tree 
from A(I) to I and from A(J) to ,3 consist solely of v-gates, 
there are |A(I)|-1 v-gates in the subtree from A(I) to I and 
|A(J)|-1 v-gates in the subtree from A(J) to 0. Functionally, 
we have 

Res(H,F) - [ fi ^ ( i)(Res(G.F))] A.[ G ^ (J) (Res(G,F))] (3). 

We are now in a position to use Mehlhorn and Gain's lemma 
(Lemma 2.8 ) to reduce the number of v-gates in the formula. 
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Define 



and 



A x = {GeA(I)|G depends on some variable in X} 

Ay = {GeA(I) j G depends on some variable in Y} 

B x = {GeA(J)|G depends on some variable in X} 

By = {GeA(J)|G depends on some variable in Y} 



Observe that since H is a top-most A-gate depending on both 
X and Y, A^nAy = B^By = 0. We construct a new formula F' 
by replacing the sub formula F R in F by a subformuia F Q 
which realizes the function 



f O = "A Res(G)) a ( s v R.s(G))] »[(* ResfGJ) a { * Res(6))]. 

a X Y Y 



If none of A x? Ay, B.., nor By is empty, then we define 



f o= [( gYa/g) a (gV g)K ^V g)a(g " b y Fg)] (4> 



In this case F Q has at least one fewer v-gate since the 

|A(I)| + |A(J)| -.2 v-gates we identified in deriving equation (3) 

have been replaced by 

(|A X H) + (|B X |-1) + (|A Y |-1) + (|B Y |-1) + 1 = |A(I)| + |A(J)| - 3 
v-gates. 



If, for example, a alone is empty, then we define F n to be the 
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obvious simplification of (4), namely 

V<gV , 'a ( 6V ) - (5) 

In this case, F Q contains (|A y |-l) + dB y |-l) = |A(I)| + |B y |-2 
v-gates. But B x ? since H depends on both X and Y vari- 
ables by definition.. Thus |B y | < |A(J)| so F Q has fewer 
v-gates then F„. 

If one of the other sets is empty we proceed similarly. If 
two of the sets are empty, then they must differ in their variable sets 
since H depends on both X and Y variables. So suppose w.l.o.g. that 
A x and B y are empty. Then f Q = 0, the constant function zero, so we 
may let F Q be the constant 0. In this case the closest v-gate in 
Succ (#,F) may be eliminated since one input to the gate is the constant 
0;-such a gate must exist since the formula F does not compute a constant 
function. 

Hence in all cases F has at least one fewer v-gate than F. 
Now if we can show that F" also computes f(x) v g(y), then 

we will have a contradiction. But this is in fact the case. Observe 
that f Q implies Res(H,F); in fact, PI(f ) consists of those 
monomes of PI(H,F) which consist solely of X-variables or solely 
of Y-variables. Since PI(v(fxg)) has no monomes which contain 
both variables from X and variables from Y, by Lemma 2.8 we know 
that each monome in PI(H,F) which contains variables from both X 
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and Y may be eliminated from PI(H,F) without changing the function 
defined by the formula F. Since the replacement of F Q for* F H 
eliminates exactly all such monomes, F and F' define the same 
function v(fxg). But since F* has at least one fewer v-gate 
than F, this is a contradiction to the v-minimaltty of F. 

D Theorem 4.6. 



Section C. Open Questions 

(1) There are several open questions listed in Table 1. Are 
any solvable in general? If the class of functions is restricted 
(e.g. to quadratic functions) are any of the questions solvable? 

(2) More generally, for arbitrary single-output m.b.f.'s f 
and g, is 

MC(f(g(x),g(y),...,g(?))) ■* MC(f) + MC(g) ? 
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CHAPTER 5 

The Monotonic Circuit Complexity of Threshold Functions 

One important class of Boolean functions is the set of symmetric 
functions, those which are invariant under any permutation of their 
inputs. Formally, a Boolean function feB is symmetric iff 

f ( x l ,x 2 x n) = f ^ x ir(l)' X Tr{2} , "** x 1 r(n)^ for every permutation 

* of the set [l:n]. One interesting fact about the symmetric 
Boolean functions is that they are fairly easy to compute by a 
circuit over a complete basis. The value ^ w s-ynwetric function 
on an argument depends solely on the number of inputs wteich are 
set to 1. Since it is possible to construct a linear-sized circuit 
which counts the number of l's in an input and outputs that number 
written in binary,. it is possible to use the output of such a 
"unary to binary" converter to compute the value of the symmetric 
function on the input. The best known general schemes for 
computing symmetric functions in this way over the complete basis 
B 2 require ~6n gates in all [Mull er and Preparata 1975] + . 

As for lower bounds on the complexity of symmetric functions, 
Lemma 2.11 shows that any non-constant symmetric Boolean function 
has complexity at least n-1 over B 2 since the function depends 

We remark that using a similar idea, one also obtains a fairly small 
formula (size 0(n 3,57 "-)) for expressing any threshold function 
[Peterson 1978]. 
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on each of its inputs. Schnorr shows that all but 8 of the symmetric 
functions of n variables have B 2 circuit complexity of at. least 
2ri-3 [1974]. Recently, Stockmeyer showed that at least half of the 
2 symmetric Boolean functions of n variables have B 2 circuit 
complexity of at least 5n/2 - S [197?], The latter result, along 
with an argument of Paul which establishes a similar 5n/2 - 0(1) 
lower Jjound on the complexity of specific functions [1977], are the 
largest known lower bounds on the B 2 circuit complexity of any 
single-output Boolean function otter than bounds obtained by 
diagonal arguments [Stoekmayer 1974]. + 

In this chapter, we consider the collection of monotone 
symmetric Boolean functions, the threshold functions. The main 
object of study will be the monotone circuit complexity of these 
functions. Recall the definition of the threshold functions. 

Definition : Suppose k,n € lN. Define the Boolean function threshold 
k of n. variables , denoted TJJeB n , by TJJ(x 1 ,...,x n ) = 1 Iff at 
least k of the inputs x,»...,x are 1. 

In particular, TJ] is the constant function 1 for all values of 
n. and T ffl 1s the constant function for all, m> n. Since 

T^Vx. and Tj=Ax t , 



+ Re «ntly, Schnorr [1978b] has announced a 3n - lower bound 
on the B 2 -circuit complexity of a function. 
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these extremal threshold functions have circuit complexity over M 
of n-1 . By observing that 

T k (x r ...,x n ) =--(TjJ_ k+1 (-.x 1 r-ix ; ,,...,-,x n )) 

we know that TJJ and T"_ k+1 are duals of each other, so that in 
general C^tJ) - C^T^^ ) and tfc(Tj) = MC(T^_ k+1 ) for 

ke[0:n+l]. 

For circuits over B 2 , the best known results are the following. 
The general uppar bound of 6n for the B 2 -circuit complexity of 
any symmetric function obviously applies to the threshold functions. 
(This general bound can be improved upon for small values of k.) 
Stockmeyer's techniques show that C B (TJJ) s 2n + min(k, n-k+1) - 3 

for any k«[2:n-l]. 

In this chapter, we present new lower bounds for the monotone 
circuit complexity of the threshold functions. For arbitrary threshold 
functions, we develop a general theorem using an approach similar to 
Stockmeyer, Paul, and others which shows that 

MC(TJJ) £ 2n + 2 min(k, n-k+1) - 0(1) 



for any ke[2:n-l]. In particular, for the "majority function" 
1 
W2~l 



T C7/9~l » we show that 



-96- 



m ( J \/2~$ * 3n " 7, 



Using different techniques, we also study lower bounds on the 
monotone complexities for the function t£. We present proofs of 
two results due to F. F. Yao , namely that 



MC V (T!!) = 2n-4 and : MC a (tJ) « fog 3 . 



We then show that these lower bounds are not achievable simultan- 
eously by proving that, for n a power of 2, any monotone circuit 
for ly witn exactly Tog r? v-gates must have at least 
2n + 4 log n - 9 v-gates. 

Before proceeding to proofs of the lower bounds, we first make 
some remarks on upper bounds for the M-circuit complexity of the 
threshold functions. As of the present, no linear (in k and n) 
upper bound exists on MC(T?) for arbitrary threshold functions. 
Hence the gap between the lower bounds mentioned above and the best 
known circuits are disappointingly large. For fixed values of k, 

L. Adleman has developed a scheme for constructing an M-circuit for 

tt 



T. which has kn + o(n) gates. 



t 

Private communication, 1976. 

tt 

Private communication, 1976. 
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In particular, he shows that 

MC(T^) = 2n + 0</n) 

and thus that the high order term of Yao's lower bound is exact 1 " . 
While Adleman's method yields the best known circuits for small 
fixed values of n, a result due to A. Yao and F.F. Yao on 
building selection networks from comparators [Yao 1974J can be used 
to show that, for fixed k, 

MC(TJJ) s Tog k+Tln + o(n). 

Finally, if k is allowed to vary. with n (for example if k= Wi 1 ), 
the smallest known M-circuit which computes tP has 0(n log 2 h) 
gates. This bound is obtained by using an M-circuft of this size 
to sort the inputs in increasing order (Batcher 1n [Knuth 1973 pp. 
111-114]) and then observing that TJj(x 1 ,...,x ) = 1 iff the k th 
largest input is 1. 



We remark that in addition, Adleman (private communication, 1976) 
has proved a lower bound of 2n + n(/n) gates for any M-circuit 
which computes t£, has exactly log n A-gates, and is synchronous 
(for definition, see [Savage 1976],p.l24). 
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Section A. The Class of all Threshold Functions 

In this section we show that the general lower bound techniques 
used by numerous researchers [Stockmeyer 1977, Paul 19755, Schnorr 1974, 
Khasin 1970] for circuit complexity can yield a larger bound when 
the basis of allowable gates is restricted to M. The basic method 
of attack is as f 61 Tows: To prove a lower bound on the combinational 
complexity of a particular Boolean function f, one attempts to 
show that there is a set of variables {x. | 1eA c [l:n]} and 
corresponding constants (c i | ieA, 0^(0,1}) such that the restricted 
function f | x _ _ c ^ Qr ^ ^ is a function whose complexity is 

known to be bounded below by some quantity. If, in addition, one 
can show that in any minimal B-circuit or formula which computes 
f, the setting x i = c 1 for t«A allows one to eliminate (as in 
Chapter 2) k gates because tnese gates now have at least one 
constant input, we then know that 



C„(f) * k ♦ C B (f j . fori£A ); 



hence a lower bound on C g (f) can be obtained. This is the gener- 
al approach we take; In particular, a lower bound on the restlcted 

function f | will be known by induction. 
x 1 c 1 
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Before describing the inductive hypotheses needed to prove our 
result, we first informally describe the method. The largest 
lower bound obtained from this method is for the majority function 
T n ry2l • We will reduce an M-circuit N for an arbitrary threshold 
function tP to one for t£Z by setting the input x h to a 
constant. The setting x_ = results in a circuit for T?" 
while the setting x « 1 results in a circuit for T.~, . If 
the constant can be chosen arbitrarily and still allow 'a large 
number of gates to be eliminated, then the constant will be 
chosen so that k' is as close to !fn*-l)/2* as possible. 

Some notation will be useful. 

Definition : The parity function , p:jN-*- {0,1}, is defined by 

is odd 



( 1 if n i 
P(n) = j 

* if n i 



s even 



The partial function £:JN*N ,->-in is defined by 
*(n,k) = |n+l-2k| whenever k s n. 

The sign functi on, denoted Sign:Z-> {0,1} is defined by 
Sign (x) 



( 1 if x s 
' if x < . 
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The function S is defined by S(n,k) = Sign(n+l-2k) 
for n.kfIN . 

Note that A(n,k) is a measure of the proximity of k to 
(n+l)/2. Observe that p(n~*(n,k)).= 1, and that for fixed n, 
the solution in IN to the equation A(k,nJ) = r is non-existent 
unless r * n-1 and p(n-r) = 1, In which case k *» (n*l-r>/2 
and k = (n+l+r)/2 are the only solutions. 

Definition : Suppose that k,A*J , ** s n-^3, and p(n-A) = Ti 
Define T p % = {threshold functions TJj: | Mn^k) = £}. 



For completeness, we define I- .*J if ,1 > n-3 



or p(n-fc) = 0. 



Define MC(n,fc) = max{MC(f) | feT }'. 

n »* 



We observe that by the above note, 1f Jt ^ n-3 and n-* is 

odd, then T^ = trJ n+1 . £ ) /2 » T (n+l+*)/2 } » and the two members of 

T are duals of each others. 
n,£ 

The prinicpal result we prove is the following: 

5.1 Theorem : Suppose n.teIN , p(n-£.) * 1, and i s n-3 (so n ;>3). 
Then HC(n,i) * 3n-max(*,l) - 6 . 
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Before proving Theorem 5.1, we observe the following corollaries: 
5.2 Corollary ; Suppose n * 3. Then MC(T n ^yg") -i.3n - 7. 

^£ T lh"/2l eT n,p(n+l) . Q Corollary 5.2 



5.3 Corollary : Suppose n z 3 , ke[2:n-T], k f (n+l)/2. Then 
MC(TJJ) s 2n + 2(mtn(k,n-k+l)) - 7. 



Proof : Assume ke[2:n-l] and k f (n+l)/2. Since 
i(n,k) <; n-3 and p(n-i(n,k)) ■ 1; we have TJj-'e"^ k x . But then, 
since k + (n+l)/2, £(n,k) = |n+l-2k| - max(n+l-2k J 2k-n-l) t 
and thus Theorem 5.1 implies 

MC(TJJ) a 3n - *(n,k) - 6 

.- 3n - max(n+l-2k, 2k-n-l) - 6 
= 2n + 2min(k,n-k+l) - 7. 

Corollary 5.3. 



The property of classes T which we will exploit is the 
following: 



5.4 Lemma : Suppose n.JieN, n > 3, and t < n-3. If feT ., then 

n »Ar 

there is a constant se{0,l} such that 
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M r \-i 'Vi.i+i 



If I <j n-5 



and 



< b > f lx n =-,s e T n-T,|,-T| 



Proof: Suppose f « x[J . e T^. We claim that -js = 5(n,4c) 
satisfies the properties claimed by the lemma. This is proven by 
examining both of the possible members of T For example, if 

f a T n T 

Vl-i)/2 € T n,A' 
then 5(i»,a} * 1, so 

f 'x n = 1 = T (n.l-i)/2 €T n-l,i+l \. 
The other case is similar. □ Lemma 5.4. 



We will use .one additional lemma which gives information about 
the structure of B 2 -circuits ( or M-circuits) computing threshold 
functions. 



5.5 Lemma [Schnorr 1974]: Let n be the basis B^ or M, and let 
N be a minimal n-circult for tJ f or n * 3 and k«[2:n-l]. 
Then there exists a gate G e Gates(N) and indices 1,J«[l:n] such 
that 
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(1) Pred(G) = {x.»Xj> 

(2) i * j 

and (3) outdeg (xj i2 + . 

Proof Sketch (For complete proof see [Schnorr 1974] or [Stockmeyer 
1977]): Since N is an acyclic graph, one can clearly find a 
node G and indices i and j satisfying property (1). Since 
N is minimal, i f j. We now prove that one of x^ and x. must 
have outdegree of at least 2. Suppose to the contrary that G is 
the unique gate in Succ(x i .) u Succ(x i ), and suppose Res(G,N) ■ 
g(x.,x.). Then two of the values g(0,0), g(0,l), and g(l,l) 
must be equal since they are all members of {0,1} . 



+ Actually, Lemma 5.5 holds for any function feB n with the property 
that, for any pair of distinct inputs x.. and x., 



{f l Xl = Cl I Cj.CjjcCO.I)} 



s 3. 



x j = C 2 



Such a function is said to satisfy the "2-3 property". Lower bounds 
on the complexity of functions with this and more generalized 
properties have been previously studied [Savage 1976]. 
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Consequently two of the functions TJJ | _ * , TJJ | x = Q , 

and TJJ | x _ ^ must be equal. Since these tJw*ee functions^ are 

distinct, this is a contradiction, so one of x. or x. must have 
outdegree at least 2. b Lemma 5.5. 



We can now return to the proof of Theorem 5.1. 

Proof of Theorem 5.1: The proof Is broken down Into two 

stages: 

(i) the result is first proven for I = n-3 for all n 

and 

(ii) the result is proven for is n-5 by induction on n, using 
as basis part (i). 

Part (i): Suppose I = n-3. In this case, T = {t!J,t" ,} . 

Lower bounds of 2n-3 on C D , and hence MC, for both of these 

B 2 
functions have been proven previously by Schnorr [1974]. This 

proof is repeated in [Stockmeyer 1977]. In fact, the lower bound 

of 2n-3 on MC(t!J) follows from Theorems 5,6 and 5.9, which 

will be proven later in this chapter independently of these results. 

The lower bound on MC(T" ,) follows by duality. 
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Part (ii): Suppose that i <; n-5, ftT . , and N is a minimal 
M-circuit which computes f. We break the proof up into several 
cases. 



Case 1 : There is an input x. with outdeg (x.) .a 3. See Figure 
5.1. 




i 



Fig. 5.1 Case 1 

W.l.o.g., by renumbering the input nodes of N we assume that 
i = n. In this case, by Lemma 5.4, there is a constant se{0,l} 
such that f -f I.. _ . .e T_ , i. ,, .. By setting x„ to 



' x n = - s 



■n-l,U-l 



-is in N, all gates in Succ(x n ,N) may be eliminated, so the 
resulting contracted network N' has at least 3 fewer gates than 



N. Hence MC(f) = C M (N) > C M (N') + 3 



s MC(f') + 3. 
If Ail, then fVT j £ _ lf so by induction 
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MC(f') + 3 > 3(n-l) - max (£-1,1) -6+3 
= 3n - max(£-l ,1 ) - 6 
> 3n - max(£,l ) - 6. 
On the other hand, 

if 4=0, then f'eT^ y so 

MC(f') + 3 > 3(n-l) - max(l,l) -6+3 
= 3n-7 = 3n - maxU.l) - 6 

This proves the result if Case 1 holds. Assume in the 
remainder that Case 1 does not apply. Assume in addition that 
x .j> x j > arid G are as in Lemma 5.5; w.l.o.g., we assume that j = n. 
Since Case 1 does not apply, outdeg (x. ) = 2, so we may let 



rV 



Succ(x n ) = {G,H>. 



Case_2: G is an A-gate, and H is an v-gate. See Figure 5.2. 

In this case neither G nor H is the output gate since 

Res(H,N) | = , is constant while f | _ is not; similarly 
n x n - ' 

Res(G.N) | x _ Q f f | _ Hence we can assume that A and B 
n n 
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are arbitrary members of Succ(G) and Succ(H) respectively. 
(Note that we are not assuming that A f B.) See Figure 5.3. 




Fig. 5.2 Case 2 




Fig. 5.3 Two Gates 
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We claim that A f H; that is, H^Succ(G). If it were 

(see Figure 5.4) then Res(H,N) = x v(x ax.) = x and the circuit 

v/ n v nv n 




Fig. 5.4 An Impossible Situation 
N is not minimal. Since Pred(G) = {x.,x } , G^Succ(H) and thus 
the gate(s) A and B are distinct from G and H. (In fact A f B 
but we do not need this). 

By Lemma 5.4, there is some constant s such that the restriction 



f 



e ^n_i I o_i I • fy setting x to -is in N, one 



of G or H now computes the constant function ""s, and hence at 
least one input to either A or B is now constant. Hence the 
resulting network N' , which computes f, can have at least 3 gates 
(G, H, and one of A or B) eliminated from it. As in Case 1, this 
implies that MC(f) > 3n - max(£,l) - 6. 

Case 3 : G is an v-gate, and H is an A-gate. 



This case is handled similarly to Case 2. 
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Case 4 : Both G and H are a -gates. See Figure 5.5 . 




Fig. 5.5 Case 4 



In this case, the objective will be to show that by setting 
:o the constant 0, at leas 
Observe that by Lemma 5 A , f 



x n to the constant 0, at least 4 gates can be eliminated from N. 



'x n = € T n-1,A+1 u " T n-l,|i-1|- 

As in Case 2, it is clear that neither G nor H is the 

output gate since f j x _. Q is not a constant function. Hence 

n 

outdeg (G) > 1 and outdeg (H) > 1. Also H^Succ(G) since if 

this were so then Res(H.N) = x n A ( x n AX i^ = V x i ^ see Fi 9 ure 5 * 6 ^- 
But this means that two gates, G and H, compute the same function 
x ax.j , a situation that never happens in a minimal circuit. In a 
similar fashion, it is possible to show that G^Succ(H). 

We now show that there are at least 2 gates in Succ(G) u Succ(H) 
Suppose to the contrary that there is only 1 gate A in 
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Fig. 5.6 Another Impossible Situation 



Succ(G) u Succ(H). See Figure 5.7. Let Pred(H) ={x ,J } , and 




Fig. 5.7 



•Ill- 



suppose that A is an op-gate, where ope{A,y}. By using the 
Identity (for any m.b.f. f) 



(V x i ) °P < x n Af ) = x n A (^ °P f) 



we may obtain a smaller circuit by replacing the subcircuit in 
Figure 5.7 by the following one with one fewer gate (Figure 5.8); 




Fig. 5.8 The Rearrangement 

that is, eliminate gates G, H, and A; add gates B and C as 
above and replace A by C in Pred(K) for every gate K in 
Succ(A.N). Thus, if N were minimal, there are at least two gates 
in Succ(G) u Succ(H) (see Figure 5.9). 

We now are in a position to show that at least 4 gates may be 
eliminated by setting x n = 0. Let N' be the circuit obtained 
by this evaluation, and note that Res(G,fT) = Res(H,IT) = 0. 
Hence all gates in Succ(G) u Succ(H) have at least one input 
constant. Since this is at least 2 gates in addition to G and 
H, this means that at least 4 gates in N" have a constant input 
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and may be eliminated, yielding a reduced circirit IT". 




Fig. 5.9 Two More Gates 



Finally, we show that this elimination of 4 gates yields the 
desired result. As mentioned r = f 1^ m Q « T^ >t+ , u T n _ lj|jMj . 

Since N" computes . f* , we obtain 



MC(f) = C M (N) s 4 + C M (N*') 
* 4 + MC(f ) 
i4 + min(HC(n-l,*+l), MC(fi-l;|A-l J) 

(since f^^uT^u..) 
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* 4 + min(3(n-l) - max(*-H ,l)-6, 3(n-T) - max(|A-l | ,l)-6) 

(by Induction) 

- 4 + 3(n-l) - maxU+l,|i-l|,T) - 6 
« 3n-£-6 * 3n - maxfcj) -6 
and the proof 1s complete for Case 4. 

Case 5 : Both G and H are v-gates. 

This case is handled dually to Case 4. 

D Theorem 5.1 



Section BL_ The Monotone Circuit Complexity of Threshol d 2_ 

We now consider a specific set of threshold functions, Tg 

for n * 2, and demonstrate some known lower bounds on the monotone 
circuit complexity for this set of functions. We first obtain 
lower bounds on the number of v-gates and A-gates separately. 
The general technique used in previous sections of setting 
certain inputs to constants and eliminating gates will be used here 
as well. For the specific function T~ we will be able not only to 
establish that a certain number of gates can be eliminated, but also 
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to determine what types of gates can be eliminated. 



5.6. Theorem : (F.F. Yao): If n^4 : then -MG v (fT3v * 2n - 4 « 

Proof' (Bloniarz) : By Induction on n. Ttte else' n = 2 is 

obvious. 

For larger values of n, we use an argument similar to that 
used previously. Assume the statement of the theorem true for n-1, 

and suppose N is an M-circuit computing T« (x, x n ) with 

exactly the minimal number MC^Ho) of ^gat^f. Assume fiirther 
that, N has the minimal total number of gates among all such 
circuits with MC v (t!J) v-gates. We show there is some variable, 
which by symmetry we assume is x , such that setting x to 
results in a circuit from which at. least two v-gates may be elin- 
ated. If the resulting contracted circuit is N'» then N* 
computes T 2 " (x^ x p , ) , and we obtain the Inequality 



rnx = 



Mc v (r 2 ') - MC y (N) * MG v (r) + 2 



s MC (t"," 1 ) +2 (since N' computes 



and the inductive assumption completes the proof . 

We use Lemma 5,5 to establish the result;, Wvl«o.g. we "^Y assume 
that there is a gate 6 in N and inputs x, and x n to N 
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such that i f n, Pred(G) = {x. ,xj and outdeg (x ) * 2. See 

in n 

Figure 5.10. Clearly by setting x to at least two gates may 
be eliminated. We prove that at least two of these gates are 
v-gates, but first introduce some useful notation. 




Fig. 5.10 



Suppose that G and H are nodes of N. Recall that a path from 
G to H is a sequence of nodes Gg,G 1 ,...,eL (for k i 0) of 



N such that G Q = G, G k = H and G^eSucc^.) for ie[0:k-l] . 
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Definition : An A-chain from G to H is a path G n ,...,G. 
such that G i is an A-gate for all ie[l:k]. That is, an 
A-chain is a path in the graph of N all of whose nodes, except 
possibly the first, are A-gates. Note that a path consisting of a 
single node is an A-chain regardless of the type of node. 

Define 

* 
T(G,N) = {HeSucc (G,N) | there is an A-chain from G to H} 

and 

V(G,N) = {H e Succ*(G,N) j H is an v-gate and 
Pred(H) n T(G) f 0} . 

(where again we omit mention of N if there is no ambiguity). 



The importance of these definitions is as follows. 

5.7 Lemma : Suppose N is an arbitrary M-circuit, and G and H 
are nodes in N. Suppose further that Res(G,N) = 0. Then 

(a) If HeT(G), Res(H,N) = 0; 

(b) If HeV(G), then H has at least one constant input 

and may be eliminated from N; 
and 

(c) If HeSucc (G) and Res(H.N) i 0, then every path from 

G to H contains a node in V(G). 
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Proof ; (a) and (b) are obvious. To prove (e), suppose 
Res(H.N) fQ. Then, by (a), there 1s no A-chain from 6 to H. 
Thus every path from G to H contains at least one v-gate, and 
the first such is easily seen to be a meKaber of V(G). 

D Lemma 5.7. 



We proceed with the proof of Theorem 5.6. Suppose U is the 

output gate of N, and let N be the circuit obtained when the 

constant ^ is substituted for x . T$y Lelnma 5.7(b), this setting 

x„ =0 allows us to eliminate all the v-gates in V(x ,N) from 
n n 



N'; if J V(x n ,W) | £ 2, then the proof is complete -. We therefore 
need consider only f V{x )| < 2. 

Because TJ] depends on x , we know that UeSucc (x ,N). 
Since Res(U.fT) is not the constant function 0, we know that 
there is at least one gate in V(x ) by Lemma 5^ 7(c). 



4. 

Note that T and V are defined in terms of the structural 
properties of the circuit; in fact V(x ,N) = V(ZERO.N'), and 
T(x n ,N) = T(ZERO.N'). We refer to these sets of gates as V(x n ) 
and T(x ) respectively. 
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So we may assume that V(x n ) contains exactly one gate H. 
Note that UeSucc (H,N) since N was an v-wfninal circuit for 
Tjj. Suppose that . Res OUT) .-•-.£« , Again using tenma 5.7(e), we 
know that V(H,N') -f f, Since the graph of N ♦$ acyclic, it is 
impossible for H to be a member of V(H), so |{H}uV(H)| a 2. By 
Lemma 5.7(b), all nodes in {H}uV(H) may be eliminated from N', 
constituting at least two v-gates. 

We may conclude the proof by demonstrating that the remaining 
case, namely when V(x n ) - «0 and Res (H^) ^£, cannot happen. 
Since H is a gate, we may assume that Pred(H> ^ HAY (where, 
it is possible that G is one of i,J, or B). Since HerV(x ), 
at least one of I or J must be in , T(x n ); we assimje w.l.o.g. that 
IcT(x n ). The steps of the proof that this case cannot occur are 
embodied in the following lemma. 

5.8 Lemma: Suppose G, H, I, and J are , as above* Then 

(a) Every path from x n to U in N must pass through H; 

(b) J does not depend structurally on x in N; 

(c) G is an A-gate; 
and 

(d) IeT( Xi ). 

Proof of Lemma : Result (a) follows from the fact that 
Res(U,N') f 0. By Lemma 5.7(c), we then know that every path from 
ZERO to U in N' must pass through H; that is, every path from 
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x n to U in N must pass through H. 

Since IeT(x n ), we know Res(I.N') =0 by Lemma 5.7(a). 

Hence Res(J.N') f since Res<I,N-) v Res(J,»-) = Res(H.N') f 0. 

Thus by Lemma 5.7(c), any path from x fi to J Would contain H, 

the unique gate in V(x n ), Since JePred(H) by definition, and 

N is acyclic, we conclude that there is no path from x to J. 

n 

Hence (b) is established. 

If G were an v-gate, then G would be equal to H and 
there would be exactly one path consisting of a single arc from 
x n to H. Since outdeg (x n ) > 2, there is a gate G' ^ G in 
Succ(x n ). Since there is a path from G' to U in IT, there 
must be a path from G' to H in N. Since both members of 
Pred(G.N) are input nodes, this is impossible, so (c) is established. 

Finally, since G is an A-gate, V(G) c V(x p ). By Lemma 5.7(c), 
we know that V(G) f since there is a chain from G to U in 
N' and since Res(G,N') = 0. Hence H e V(G), so by definition at 
least one of I or J is a member of T(G). If JeT(G), then 
JeT(x n ) in contradiction to Lemma 5.8(b); hence IeT(G). Since 
T(G) c T(x 1 .)» the proof of (d) is complete. 

D Lemma 5.8. 



Now let N-- be the circuit obtained from N (not fT) by 

setting x 1 to 0. Res(U,N"), being the threshold 2 function of 

the remaining n-1 variables (x, x } - {x.}, depends on x . 

i n i n 

By Lemma 2.9, we know that there is a path C in N" from x 
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to U such that every gate in C depends fictionally on x . 

n 

Since H is on every path from x n to tf in N by Lemma 5.8(a), 
it must appear on every path in N" and hence on C. Thus 
Res(H,N") depends on x in H". 

However, by lejpas 5.7(a) and 5;8(d? ( ResOiN'^} ■ Gj, so 
Res(I,N") does not depend functionally en x .Similarly, by 
Lemma 5.8(b), J cannot depend functionally en x in N, and 

hence in N", so Res(J,N") does not depend functionally on x. 

n 

Thus Res(H,N") does not depend on x n v a contradiction. 

Theorem 5.6. 

We can also get a lower bound ore the lumber of A-gates in 
any M-circuit which computes t!J. Here the technicjue used is 
similar, but we must set more than one input to a constant to 
eliminate a single A~gate. 

5.9 Theorem: (F.F. YaO): Suppose nil. Then MC A (T^) s Togg n 1 

Proof: We prove this fact by strong induction on n. For 
n = 1 the result is trivial. 

Now suppose n * 2 and the statement of the theorem is true 
for all n' < n. Suppose N is an A-nninimal M^ircuit Which 
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computes t!J. Since n a 2 and since x^l^lj)- for any Index 
1e[l:n], N must contain at least one A-gate. Let 8 be a 
top-most A-gate in N; that 1s, 6 is an A-gate such that every 
gate in Pred (G) is not an A-gate. Clearly such a gate exists 
since N is acyclic. Suppose PredfG) * W, J}. See Figure 5.11. 




Fig. 5.11 A Top-most A-Gate 
Since G 1s a top-most A-gate, all gates in Pred + (G) are 
either v-gates or inputs. Hence Res(H.N) = ,V.x. and 

Res(J,N) » i YB x i for some n° n " em Pty subsets A and B of [l:n]. 
Observe that 

, ReS ^ N ^(i e lB X i)M 1 X B (V X j^ ™ 

jeB-A 

Let C denote the smaller of the sets A-B and B-A and let c 
denote |C|. Then one can easily check that c £ n/2. By renum- 
bering the inputs, we may assume w.l.o.g. that C « [n-c+l:n]. 

Let N' denote the circuit which results when the entire set 
of variables {x^ | ieC} is set to 0. Note that N * computes the 
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_ T n-c 



function T 2 | x = o for 1 e C = T 2 * By (1) we know that 

Res < 6 ' N '> = 1eAnB x 1 ' 

We now modify N' to eliminate gate Q, Construct * new 
tree, of v-gates and input nodes which coajputes the Junction 
1eAnB x i at a gate 6 '* Eliminate G from N', and replace G 
by G' in Pred(K) for alV gates KeSucq^GJI') . (If G were 
the output gate of N, then G' will now Me the output gate.) 
Call the resulting circuit N". Sfh©e\ G has been replaced in 
N" by a node which computes the samel function, N"" also computes 
T 2 " . Since N" has one fewer A-gate than N and since N 
had a minimal number of A-gates computing t2, we know that N" 
does hot Compute TJJ and thus c 2 1. Hence 



MC A (Tj) = MC A (N) »'l + MC A (N") 
* 1 + MC A (T^" C ) 
^ 1 + log(n-c) (by induction) 



and the theorem is proved. 



* 1 + log(n/2) (since c <. n/2) 
= log(n) 

D Theorem 5.9. 
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In preparation for future results in the chapter, we note the 
following additional facts gleaned frop the proof of Theorem 5.9. 
Suppose n is an exact power of 2, and N is an M-circuit for 
T 2 with exactly log n A-gates (which we later show is possible). 
If G is a top-most A-gate of N. then if c (as defined in the 
proof of Theorem 5.9) is less than n/2, the setting of the variables 
{x i | ieC} to leaves a circuit IT for a function t"\ where 
n'> n/2, which has 1 fewer A-gate than N. But this is impossible 
since then 

log n * MC(N) 

2 MC(fT) + 1 

s log n' +1 
= log (2n') 

> log n 
Hence we have proved 

5.10 Corollary : Suppose that n = 2 r for relN , and that N is 
an M-circuit for t!) with exactly log n A-gates. If G is a 
top-most A-gate of N and Pred(G) = {H,J}, then there is a 
partition of [l:n] into two sets A and B such that 
|A| = |B| = n/2 and such that Res(H) = ..V^. and Res(J) = . V gX.. 
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We proceed with our discission of T* by demonstrating that 
each of these lower bounds is attainable in some M-circuit comput- 
ing T^. 

An M-circuit wtth exactly 2h-4 v-gates for TJJ (for n t 2) 
is obtained by use of the following recursive dons trtictions. We 
will show inductively that, for n a 2, there is a two-output 
M-circuit on Inputs ■ (x, ,...,x }' containing 2n-3 v-gates 
which computes T^tx,,... ,x ) and T|(x, , . . . jX n ) . For n = 2, 
the circuit is given in Figure 5.12. 




Fig. 5.12 A circuit for n=2 



Assuming the fact true for n, we may construct such a circuit 
for n+1 variables with 2 additional v-gates (and an, A-gate) 
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by use of the recurrence relations 



T 2 (x T x 2 VW = T ^ x T"" x n ) v (x n+l A T r( x i >■• • »*„> 



and 



r n+l 



T I|T| , . —11/ » 

T l (x l Wl* = Vl v T i( x r---.x n ). 



See Figure 5.13. 




Fig. 5.13 The Recursive Constructi 



on 
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A circuit for 1"!} with exactly 2n-4 v-sa*e*t may be thus 
obtained since at the last step in this construction of Tp. there 
is no need for the v-gate marked by * in Figure 5.13. 

5.11 Corollary : For n a 2, MC (TJJ) = 2n-4. 

As for exhibiting an M-circuit for fj with exactly Tog fH 
A-gates, we first describe a general method for such a construction 
which was communicated to the author by F.F. Yao and A. Yao. For 
the remainder of this chapter it will be convenient to have the 
variable indices begin with rather 1. ._ 



Definition : Suppose £elN . Ht s.l a 2* and 1 * is £, define 
(i).. to be the j digit in the binary expansion of i.'that is, 

1 if Hmod 2 j ) * 2*" 1 
(i). 



><-{ 



otherwise. 



Suppose new . Define for each j«[l : Tog (n)"*j4toe following 
subsets of the variables {x. | ie[0:n-l]}. 

A^ - (x. | i € [o:n-l] and (f). =0} 
and Bj = {x i | i € [0:n-l] and (i). = 1} . 
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Define corresponding Boolean functions a^,.b?€fl by 



VV" ,,x n-l* * l£Aj X i 



and b j< x Vl 1 ' IcB^i ' 



(We omit mention of n if it is clear from context. ) Finally, the 
radix- join function F n e B n ? Toa r? * 9 defined by 



F n « (a 1 ,b 1 , a 2 , b 2 ,.. M *ff ogW & rf og „1 )• 
An M-circuit which computes F n is called a radix- join network. 



It is easy to see the relationship between the radix- join 

function and the function "Threshold 2." In fact, with the 

definition given above. 

Tog 7? 

t5 - V (a.Ab.). (2) 

6 j = 1 J J 



To prove (2), note that the sets of variables A. and B. are 

J j 

disjoint for each je[l : Tog r? ]. Hence, letting h be the 
function defined by the right hand side of (2), we know that h £ T 2< 
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Conversely, for any pair of variables x^ and x,, with k j i, 
there is some position j at which the binary representation of 
k and i differ. Hence Xj c x,ePl{a,.-xb.) 1 $o \x. £ h. This fact 
being true for every k f i in [0:n-l], we know that t!J s h. 
See Figure 5.14. 




fiognl 



Fig. 5.14 An M-circuit for T 2 



Applying the decomposition (2) to complexities, we have the 
following: 
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5.12 Lemma: Suppose n«N , and F n 1s the radix-4oin function. 
Then 

MC(Tj) s MC(F n ) + 2 l Tog i? - T 
and ^(Tj) s MC^f") ♦ fog n 1 



Since F n Is defined only 1n terms of v-gates, MC (F* 1 ) = 0; 

A 

hence we have: 



5.13 Corollary : For n s 1 , MC (tJ) = Tog n 1 . 



From the formula-definition of the radix- join function one can 
clearly construct a radix-join network with 0(n log n) v-gates. 
A straightforward recursive construction yields a radix-join network 
with 3n - 2 log n - 4 v-gates, yielding an M-circuit for t" 
using Lemma 5.12 with a total of 3h-5 gates. Several substan- 
tially different M-circuits for Tg with this size can be constructed, including 
the circuit in Corollary 5.11. L. Adleman proved that asymptotically 
fewer than 3n-5 gates are necessary 1 " , and using his techniques 



+ 

Personal communication, 1976 
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one can prove the following: 

5.14 Theorem : Suppose nelN . Then 

C {v} (F n ) s2» + 0(Vn), 



Proof : Suppose n >4 and let i = Tqg nl Let n, « . 2 I&/S " 

and n 2 = fi/n^ 1 . We will use the following recursive expansion 
to construct an efficient radix-join network for n variables. Let 
X denote the set of variables {x i | ie[0:n-l]}. 

Suppose that p^O^-l] and qe[0:n 2 -l]. Define subsets 
of the variables X for such p and q by L' = {x. | ie[0:n-l], 
i (mod n^ = p} and H^ = {x 1 j i € [^:n-l3, fi/n,j = q}. Informally 
if all variables are thought to have indices whose binary represen- 
tation is padded to length exactly 4, then L' is the set of 
variables which have a binary index whose lower order |*/2, bits 
are equal to the binary representation. of p. Sinilarly, H' is 
the set of variables whose binary index has a, high order 1/2 1 

bits equal to the binary representation of q. We have correspond- 
ing functions L n and H„ in B„ where L„ is the disjunction 
p q n P 

of all variables in L', and H is the disjunction of all vari- 
ables in H'. Note that each variable x^X is a member of exactly 
one of the sets {L' | peCO:^-!]}, namely the unique p such 
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that i (mod n^) » p. Similarly* it is easy to verify that the 
collection Hq,^ ,...,H n _ 1 is a partition of t> a set X and that 
x^H where q = • |1/b-jj. Moreover, none of the sets H q is empty 

V 



for qe[0:n ? -l] since q«n, < n,n,- n, < n and x„ „ eH . 
^ l I £ I q«n, q 



We claim that the relationship 

F" = (F n l(L ,L r ...,L n _.,)> F n2 (H ,...,H n -T )) (3) 

holds, where we use the obvious isomorphism between {0,1} 2 *and 

w.n^xw.iW 

Assuming this for the moment, we describe an efficient radix- 
join network using (3). Using the definitions of H and L , 
we construct an independent {v} - circuit (which happens to be 

fan-out free and is thus a formula) for each of the functions 

Lq.L-j !_ n _-j and the functions Hq.H.,... ,H , . To count 

the number of v-gates which this takes, suppose |L"| = a for 
peCO:^-!]. Then, since L is a disjunction of single variables, 
C{ V }(L ) = * p - 1 since ■ % f O.for each p. Hence, the total number of 
v-gates required to compute all the functions L ,L,,...,L n _-, is 



pfo:n,-l] V> '- ( pl[0:n r l] V • "l 



= n "- n. 
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slnce each x.eX Is a member of exactly one of the sets t* for some 
pe[0:rij-1]. Similarly, one can compute all the functions H_ 
for qc[0:n 2 -1] with a total of n » n^ v-gates* giving a total 
of 2n - n 1 - n 2 gates in all. 

We now use the n, functions L_ for pe[0:n.,-lj as inputs 
to a recursively constructed radix-join network F 1, and likewise 
the «2 functions H for qeEdriwrrT] as inputs to a recursively 
constructed radix-join network F 2 . Using equation (3), this yields 
a circuit for F n * (See Figure 5.15.) 
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Fig. 5.15 The Recursive Radix- join Network 
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If we let T(n) = C.,(F n ), then we have the recurrence rela- 
tionship 

T(n) s 2n - n 1 - n 2 + l(n^) + T(n 2 ). 

Since n^ < i^n and n 2 s /2rT and since t(4) =4. this recur- 
rence can be bounded by 

T(n) s 2n + t 4(n/2) 1/2 + 8(n/2) 1 ^ + 16(n/2) 1/8 + .. 

1 

(log log(n/2) terns ) 



log log(n/2) . , 1/? t 

2n + I 2 k+1 (n/zy /d 

k =1 



which establishes the theorem since the summation is 0(/n). 

It remains to prove equation (3). Suppose that a", b", a", 

j j J 

and bj are as in the definition of F n . Suppose that j £ |£/2| . 
Then Aj = {x. | (i) j = 0} = (p) cy.' since (1)j = iff (p). = 
where i (mod „,) - p . Hence J aJ - "^V^ ? ^ ^ 1s tte ^ 

responding component a.. 1 of F 1 (L , : .. rr ..l^ _ r ). Similarly bj 
is the component b. 1 of F ^^.....L -,). 

J U -....•■■' fl^* I ■-'". ■■ 

A similar proof demonstrates that when iA/2j < j s i t a" is 

J 
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the component z£ ^ of F 2 (H Q , . . . ,H n _ } ) and b n . 1s the 

D Theorem 5.14. 



n« 
component b/ „ /0l 



Applying Lemma 5.12, we have 

5.15 Corollary : (Adleman) Suppose ne N . Then 

MC(T^) s 2n + O(vfc). 

We will have more to say about this method 1n Section D. 
Section C - M-circuits for Threshold 2 which are A-minimal 



In the previous section, we demonstrated lower bounds of 2n-4 
and 1og 2 rP on the number of v-gates and A-gates respectively 
necessary to compute t" by an M-circuit, and demonstrated circuits 
which achieved each of these bounds. The question naturally arises 
of whether there is a single M-circuit which computes t" and 
which simultaneously achieves this minimum number of A-gates and 
v-gates, and in this section we show this is impossible for infinitely 
many values of n. Our technique will be to. establish larger bounds 
on the number of v,-gates necessary to compute tJ (for n a 
power of 2} for any M-circuit which has exactly log n A-gates. 
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We begin by examining the structure of such circuits with 
exactly log n A-gates. In Section B, we saw that such a circuit 
may be constructed from a radix- join network. In this section, we 
will show that it is possible to extract a radix-join network from 
any M-circuit for T? with exactly log n A-gates. For now, we 
assume that n = 2 is an exact power of 2 (for some r > 2), and 
that N is an M-circuit which computes Tp, has exactly r A-gates, 
and among all such circuits has a minimal number t of v-gates. 



5.16 Theorem : Suppose n,N, and t are as above. Then an 
M-circuit R exists which computes F n , the radix- join function, 
and which has t - log n + 1 v-gates and no A-gates. 



Proof : We show how to extract R from N. We first show 

n 

that there is a restructuring N Q of N which also computes T« 
in which there is only a single A-gate on any path from an input 
node to the output gate - that is, there are no A-gates in 
Pred (G,Nq) ySucc (G,N Q ) for any A-gate G. Call such a circuit 
a Single -level circuit. Moreover, we show that it is possible to 
construct N with no additional A-gates or v-gates. 

Suppose that N does not have this property, and let U 
denote the (unique) output gate of N. Let G be a top-most 
A-gate in N which has at least one A-gate below it; that is, 
Pred (G,N) contains no A-gates and Succ (G,N) contains at 
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at least one A-gate. Observe that G f U by definition since 
Succ (U,N) *J. We show how to re*wirt the circuit so that 6 is 
no longer on a path with any other A-gate. 

We first construct a circuit N- as follows. Add an additional 
v-gate U', and set Pred|U',r) » CS.U). Inaddition , for 
every gate H e Succ(S,N). replace G in Pred(H) by the constant 
node ZERO. See Figure 5,16, 





Fig. 5.16 The Reconstruction 



Then simplify this circuit to yield a third circuit N^' by 

simplifying any gates with at least one input a constant function. 
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Since outdeg (G,N) s 1, this removes from N' all gates in 
V(ZERO.N') (as defined in Section 5B) and hence at least 1 v-gate, 
so IT has at most t v-gates and at most r A-gates. 

We claim that Res(UMT') -TJ), and hence N*r has exactly 
t v-gates and r. A-gates. (Note that Res(U',N") . - Res(U'.N') ). 
To prove this, assume that c = ( C] ,c 2 ,... , C|) ) is an arbitrary 
constant in {0,1 } n . We show that 



Res(UMT)(c) = jj(c). 



Since N had log n A-gates, by Corollary 5.10 we know that 

there are subsets A and B of [0:n-l] such that AnB = and 

Res(G,N) = .^Xj . Hence Res(G.N) s tIJ. Thus, in the case 
jeB 

that t!J(c) = 0, we know Res(G,N)(c) = 0. In this case it is 

easy to prove by induction on length of paths that 

Res(K,N)(c) = Res(K,N')(c) for every KeNodes(N). Hence 

Res(G,N')(c) vRes(U,N-)(c) = 0, and thus Res(U',N')(e) *0. On the 

other hand, suppose Res(U',N')(c) = 0. Then Res(U,N')(c) = 

Res(G,r)(c) = so Res(G,N)(c) = since N and N* are 

identical on the predecessors of G. Thus, it is again easy to 

verify by induction that Res(K,N)(c) = Res(K,N')(c) for every node 

KeNodes(N), so Res(U,N)(c) = 0. Hence 
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(Res(M)(t) = 0) *>(Res(U',N')(t) * 0) 

so Tg = Res(U.K) .* Res(U'.N-). 

The effect of this rewiring is to remove G from any path 
which has more than 1 A-gate on it. Since the transformation 
decreases the number of A-gates with A-gate successors*, by recur- 
sively repeating this process for top-level A-gates with A-gates 
below them, the result is a single-level M-circuit for T^ with 
r A-gates and t v-gates. Call this modified circuit N Q . We 
remark 1n passing that every path from an input to the output in 
N Q must have exactly one A-gate on it since PlCrJ) contains no 
single variables. 

We can now extract the radix- join network from N Q . Let U 

denote the output gate of N Q , let G '■' * {G^Gg 6 r ) denote 

the set of (incomparable) A-gates in N Q , and let Pred(6.)= {LuM 
for all i e [l:r]. See Figure 5.17. There 1s a gate in 6 on every 
path from INPUTS(N) to U. Since N is minimal, there is a path 
from each node of 6 to U. Since there are only v-gates on any 
path from 6 to U, we know that Res(U) « G v G Res(G). Moreover, 
since N- contained a minimal number t of v-gates , we know that 



Observe that for arbitrary nodes A and B of N , if A 
is a member of Succ (B.N"), then A was a member of Succ (B,N'), 
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the portion of the circuit between the A-gates 6 and the output 
U consists of a tree of agates. Thus there are r-1 v-gates 
H = {H 1 H r -j} {one of which is U) below G in the circuit. 




Fig. 5.17 The Circuit N 



1 



Fix je[l:r] and let ie[0:n-l]. By Corollary 5.10, since N, 
has r = log n A-gates, we know that there is a partition of the 
input variables X = {x |k 8 [0:n-l]} into two sets A and B 
such that Res(I.) is the disjunction of all variables in A and 

w 
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Res(Jj) is the disjunction of all variables in B. Hence, 
since the circuit of the predecessors of G consists entirely of 
v-gates, there is a path from the particular variable x. to exactly 

one of I . and J .. 
J J 

Moreover, suppose that x i and x k are two distinct variables 
of X. Since x^ePlCrlJ), and since TJJ = G ^G , there must be 
some gate G. such that x.x. 6 PI(G.). Hence there is either a 

J 1 K J 

path from x. to I. and a path from x. to J. or vice versa. 

J K J 

With these observations, we may renumber the inputs as follows: 
If E is an input node to N Q , we label E by x. , where we 
define the index ie[0:n-l] for E bitwise by defining 



1 if there is a path from E to I. in l\L 

l V j ° 

v if there is a path from E to J in N n 

J u 

for each je[l:r]. By the above observations, it is easy to verify 

that, since n = 2 , each input node gets labelled with a variable 

from {x.j | ie[0:n-l]} . Moreover, each input node gets labelled 

with a distinct variable since two distinct input nodes must have 

paths to different predecessors of one of the A-gates G.. We 

J 

assume w.l.o.g. that this is the numbering of the inputs since t" 
is a symmetric function . 

Now let R n be the circuit obtained from N Q by deleting the 
sets of gates G (all the A-gates of N Q ) and H (r-1 v-gates) 
Since R n is an {y}-circuit, we know that, for any gate D in R , 



-141- 



[x.ePI(D)] ♦ [DeSucc (x..)]. Hence, if the radix- join function F n 

has components (a, ,b, ,a 2 ,t> 2 ... ,a r ,b ), then, for each je[l:r], 

a. = Res{J <a R ) and b. = Res(I.,R ). Hence R is a radix- join 
J J n J J n n 

network for n-variables. 

Since R contains t - r + 1 v-gates, the theorem is proved. 

B Theorem 5.1 6. 



Armed with this theorem, we proceed to obtain bounds on the 
number of v-gates needed to compute the radix-sort function in 
{v}-circuits. We first prove a general lemma about {v}-circuits. 

Defintion : Suppose N is an M-circuit. Define the center of N_, 
denoted Z N ,by Z N = {GeGates(N) | Res(G.N) = }. Note that 
Z N = unless outdeg(ZERO) > 1. 

5.17 Lemma : Suppose N is a {v}-circuit with inputs {x, ,x 2 ,.. .x } 
for which outdeg(ZERO.N) = , and some subset A of inputs to 
N is set to 0, yielding a circuit N'. Then it is possible to 
remove from N' at least 



I outdeg(I.N) + I (outdeg(G) - 1) 
I e A GeZ N > 

v-gates in Succ (A,N). 
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Proof: Recall that the operation "setting a variable x to 

0" in a circuit involves replacing x by ZERO in the predecessors 

of all gates in Succ(x). Hence outdeg(ZERO,N') = £ outdeg(I.N) 

IeA 
and it suffices to show that at least 

outdeg(ZER0,N' ) + £ (outdeg(G) - 1) (4) 



GeZ N- 



+ , 



v-gates in Succ (ZERO, IT) may be removed from fT . We prove 
this by induction on f Z M-l • 

If Z N . = 0, then e\/ery gate in S = Succ(ZERO.N') may be 
eliminated from FT since each gate in S has at least one input 
constant. If \S\ < outdeg(ZERO,N') , then by the pigeon-hole 
principle there must be some gate GeS with both nodes in Fred(G,JT) 
being the node ZERO. But then G^Z^, which is a contradiction. 
Hence, at least outdeg(ZERO,N') gates may be eliminated. 

Now assume that |Z N _| = k > 9 and that (4) is true for all 
{v}-circuits with centers of size k-1 . Let G„ be a member of 
I(Z N J, the set of initial gates of Z^, and suppose Pred(G) = {J,K}. 
See Figure 5.18. Since Res(G Q ,r) - and G Q is an v-gate, 
we must have Res(J.N') = Res(K.N') - by Lemma 2.1. Since J 
and K are not members of 1^ (by definition of Gj, it must be 
the case that J = K = ZERO. See Figure 5.19. 
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Fiq. 5.18 G n e I(Z N . ) 




V) G C 



Fig. 5.19 



We construct a new circuit N from N' by eliminating gate 
G Q , and replacing G Q by ZERO in Pred(H) for every gate 
HeSucc(G n ,N'). For every node LeNodes(N), we know that 



Res 



(L,N') = Res(L,N) since G Q was replaced by a node which 
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computes the same function. Furthermore, |Zd = |Z N -|- 1 

(since G Q was removed from the circuit); outdeg(G,N) = outdeg(G,N') 

for every gate GeZj^; and outdeg(ZERO,N) = 

outdeg(ZERO,fT) - 2 + outdeg(G Q ,NO . Thus, by induction, it is 

possible to remove from N at least an additional 



outdeg(ZERO,N) + I (outdeg(G,N) - 1) = 



GeZ N 

outdeg(ZERO,fT) +[ £ (outdeg(G,ir) - 1)]- 1 gates from 

G e Z r 

Succ (ZER0,N) £ Succ + (ZERO,N'), which, together with the elimination 
of G Q , proves the lemma. 

D Lemma 5.17. 



One additional lemma will prove useful: 

5.18 Lemma : Suppose n e w, n > 4, and n f 2 r +l for any r-.e K 
Let R n be a minimal {v}-circuit for the radix-join function F . 
If i e[0:n-l], then outdeg(x.,R ) > 2. 

Proof: Suppose n, R , and i are as in the statement of the 
lemma. Let F n = (a ] ,b 1 ,. . .a rr o ^ ,b pf og ^ ). Observe that 
iPlCa-j ) | = f n/2 n and . | PI ( t> 1 ) | = [n/J and hence neither PI^) 
nor PI(b^ ) consists of a single variable. Since x. is a member 
of one of Pl(a-j) or PI(b.j), we must have that outdeg(x. ,R ) > 1. 
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Suppose that outdegU^ ,R n ) = 1 , and let Succ(x.,R n ) = {G} 
where Pred(G,R n ) = {x^H}. See Figure 5.20. Since N is a {v}-circuit 




Fig. 5.20 The Case outdea^) = 1 



PI(H,R n ) is a non-empty collection of single variables, so let 



x^ePI(H,R n ) be chosen arbitrarily. Note that k f i since the 
graph of R is acyclic. 

It is easy to verify that in any {v}-circuit N, 



[KeSucc (L,N)] ^[PKL.N) c PI(K.N)] (5) 

for any K,L e Nodes(N). Thus, in this case, since 

Succ (x. ,R ) * Succ (G,R ), we must have x. ePI(K,R ) for any gate 
in n k n 

KeSucc (x. ,R ). However, if we let j be a radix position such 
that (i). = (k). (which must exist since i f k), then one of 

J J 
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PI(a.) or PI(b.) should contain x* but rwt x.i assume W.l.o.g, 
that XjePIfa ) and x k ^PI(a.). Since R computes F n , there 
must be a node D in R which computes a.. Since n f 2 r +l , 
no component of F n (including a.) is the projection of a single 

J 

variable, so D must be an v-gate. Since D depends on x. 

and hence is in Succ (Xf,Rj, we must hive jx.ePI(a ) which is a 

in -"t k j 

contradiction. , Q Lemma 5.18 



We are now ready to establish a lower bound on the complexity 
of any {v}-circuit which computes the radix-join function. 

5.19 Theorem : Suppose neM . Then Cr vJ (F n ) a 2n + 2Llognj - 8. 

Proof : The theorem is easily verified for n s 4. For larger 
values of n, we first solve the problem, in the case in which n is an 
exact power of 2 v say n « 2 r for some ::-.t*W 4 Me proceed by 
induction on r; the cases r = 1 and 2 have already been disposed 
of. 

Assume the statement of the theorem true for r - 1 , and that 
n = Z . 

Suppose that F n » (a 1 ,b 1 ,...,a r ,b r ) and. that R R is a 
minimal {y}-circuit which computes F . Our general objective will 
be to show that by setting half of the variables {x, ,x 2> ...tX ri } 
to 0, one can eliminate at least n + 2 v-gates from the resulting 
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circuit and have a circuit ft- which computes. ¥ . Assuming 
this for the moment, we then know that 



* n + 2 + C {v} (F n/ ?) 

2 n + 2 + 2(n/2) + 2(log (n/2)) - 8 

(by induction) 

= 2n + 2 log n - 8, 

thus proving the theorem. 

So suppose R is as above, and that L and J. are the 
nodes in R which compute a. and b. respectively, for 
ke[l:r]. 

By fact (5) of page 145, in R each output node in 

{I, ,J, ,... ,1 ,J } must have outdegree zero since the sets of 

prime implicants of each component of F n are incomparable. In 

particular, if D, and D 2 are arbitrary output nodes, then 

| PI(D,)nPI(D 2 )| ^ n /4. Each output node is an y-gate since each 

component of F has n/2 > 1 prime impltcants. These are the 

only nodes in R„ with outdegree zero since R is minimal, 
n n 

Let 



A= PI(I r ,R n ) = {x,'| (i) r ■ 0} 
and 
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B = PI(J r ,R n ) = {x i | (i) f = 1} 

Observe that AnB = and |A| = |B| = n/2. Let R. (resp. R & ) denote 

the circuit obtained by setting all variables in A (B) to 0. 

It is easy to see that R R , with output nodes H-i .J-. >. . . >I -i >J -i) > 

is a radix- join network on n/2 variables. Similarly R. with 

the same set {I, ,J, ,. . . ,1 , ,J , } of output nodes is a radix-join 

network of the remaining variables in B if the r bit of the 

index of each variable in B is changed from 1 to 0. 

We show that n+2 gates may be eliminated from one of these 

circuits R. or R„. In R., the only gate with outdegree zero 

which computes the constant function 0_ is I By Lemmas 5.17 

and 5.18, this means that at least 2-(n/2)-l = n-1 gates in 

Succ (A,R) may be eliminated from R. (where the only node in 

Z D being accounted for is I ). In addition, since 

R A r 

outdeg(J ,R«) = 0, J r also may be eliminated since it is no longer 

an output gate in R.. Since J /Succ (A,R ), it was not eliminated 

m i n 

previously, so at least n gates may be eliminated from R.. 

In a similar fashion one may count at least n gates eliminated 
from R D . 

D 

To show that at least two additional gates may be eliminated 
from R. or R R we must look at other gates in the circuit. Let 
Pred(I r ,R n ) = {K A ,L A > and Pred(J r ,R n ) = {I^.Lg}. See Figure 5. 21 . 
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Fig. 5.21 The Predecessors 



We consider several cases: 

Case 1(a) : K A and L A are v-gates with outdegree in R n 
more than one;. See Figure 5.2 2. 





Fig. 5.22 Case 1(a) 
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In this case, two additional gates in R. may be eliminated 
by Lemma 5.17. Both . K^ and L. are members of Z R and both 

are of outdegree at least two in HL.. The count above of n 

gates considered only those members of Z D of outdegrfee zero. 

R A 



Case Kb) : Kg and L g are both v-gates of outdegree in R more 
than one. 

If this is the case, then 1n R« two additional gates may 
be eliminated; the proof is handled as in 1(a) with A replaced 
by B. 



Case 2(a) : K. and L. are both v-gates of outdegree one in 
See Figure 5.2 & 




/ 




Fig. 5.23 Case 2(a) 
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In this case, both K. and L« in addition may be eliminated 
from Rg. We have previously eliminated from R g the unique gate 
I in Succ(K.,R ) = Succ(L.,R ). Since neither K. nor L A 
is an output gate in R B , each may also be eliminated from R„. 

Case 2(b) : Kg and Lg are both v-gates of outdegree one in R N - 
Proved similarly to 2(a). 



Case 3(a) : Cases 1 and 2 do not hold; one of {K A ,L A > is an v-gate of out- 
degree more than one 1 n R ; and one of {Kg.Lg} is an 
v-gate of outdegree one in R n . (The other nodes might 
possibly be inputs.) See Figure 5.24-. 





Fig. 5.24 Case 3(a) — One Possibility 
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In this case two additional gates v-^gates may be eliminated 

from R A . Z R has at least one gate of outdegree at least two 
A 

since both K A and L A are members of ; Z R , so Lemma 5.17 allows 
us to eliminate at least one additional gate, the v-gate of 
outdegree one in (Kg.Lg) may also be eliminated since its unique 
successor J r had been previously eliminated and since neither 
Kg and Lg is an output gate in R.. 

Case 3(b) : Same as Case 3(a) with A and B interchanged. 
Proof is similar. 

Casejk One of (K A ,L A ) Is an input, the other is an v-gate of 
outdegree one; one of (Kg*Lg} is an input, and the 
other is an v-gate of outdegree one. 

Suppose w. l.o. g. that K A and Kg are inputs. Let 
Pred(L A ,R n ) ={M,N}. See Figure 5.25. At least one of M or N must 





Fig. 5.25 Case 4 
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be an v-gate, for 1f both M and N were inputs, then 

|PI(I r iR n )l = 3; but |PI(a r )| ■ (n/2) a 4 so this is impossible. 

Suppose w. l.o. g. that M is an v-gate. 

If outdeg(M,R ) s 2, then MeZ D .so in R. one additional 
n R A A 

gate in Succ (M,R ) may be eliminated by Lemma 5.17. Also Lo 

may be eliminated since it is not an output gate and its unique 

successor has been eliminated. On the other hand, if outdeg(M,R ) 

= 1, then in R B both L. and M may be eliminated in turn since 

each of their successors has been eliminated and neither is an 

output gate. 

The above list of cases exhausts all possible arrangements of 
K., L«, Kg, and L,; the other arrangements are not possible in a 
minimal {v}-circuit. For example, it is impossible for K. to be 
an input and L. to be an v-gate of outdegree more than 1 . If 
so then since |PI(I r ,R n )| = n/2, |PI(L A ,R n )| * (n/2) - 1. 
However, the set of prime imp! i cants of any output other than I r 
has at most n/4 < (n/2) -1 variables in common with PI(I r ,R n ), 
and hence the only output in Succ (t»,R ) is I by fact (5) of 
page 145. Similarly, it is not possible for both K. and L^ 
to be inputs. 

This completes the proof of the theorem in the event n is. 
an exact power of 2. 
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If n is not an exact power of 2, let r » {Lag fl and 
let n Q = 2 r . Suppose that R_ 1s a mtiyimal {vHcircutt for 

F n = (a 1 ,b 1 a r +i» b r +i) and "•!** *k anc * J^fce nodes in, R n 

computing a. and b. respect tvely (for ke{l:r+l}). Let B be 
the set of variables {x. | ie[n :n-l3}; and observe that 
PI ( b r+l) = B * If R ' 1s tne circuit obtained by setting all 
variables in B to 0, then R', with outputs {I,»J ! .».'..*I r >J r } » 
is a radix-join network for n Q variables. 

If n f n Q + 1, then by Lemma 5.1 8 each Input x^eB has 
outdegree at least two in R . Hence by Lemma 5.17 at least 
2|B|-1 gates in Succ (B,R ) may be eliminatedfrom R' (since 

^r+1 ^"R" ^ s of outde 9 ree zero). In addition, *f+p which is an 

v-gate since | PI(a^.-, )| > 1, may also be eliminated from R' 

since it is an v-gate of outdegree zero which is not an output of 

R" . Since |B| = n - n Q , this means that a total of a t least 2{n - n n ) 

gates may be eliminated from R'. 

If n ■ n + 1, then PUb^) ■ {x^}- Since 

x ^PlCa-j) and |PI(a,)| > 1, we know that putdeg(x R ) * 1. 

Since N is minimal, J^, 1s the input node X n .^> The setting 

of x , to results in a circuit R' from which at least one 
n-l 

gate in Succ (x n _-j ,R n ) may be eliminated (since there are no. 
gates of outdegree zero in Z R J. Moreover, 1^, is again a gate 
and may be eliminated from R' in addition to the gate previously 
mentioned, giving a total of two gates which may be eliminated from R'. 
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In either case, R' may be simplified to be a radix-join network 
R" on n Q variables with the elimination of 2(n-n ) v-gates. 



Thus 



^C {v} (F n 0) + 2(n-n ) 



s 2n Q + 2 log n Q - 8 + 2(n-n Q ) (since 

n Q is a power of 2) 



= 2n + 2r - 8 



and the theorem is proven. 



D Theorem 5.1 9. 



5. 20 Corollary : Suppose that n = 2 r , re IN , r k % and N is an 
M-circuit for I 1 ! with exactly log n a -gates. Then 



C M (N) s 2n+4 log n - 9. 



Section D_ - Ef f i ci ent Circuits for Thresh old Functions 

As mentioned, the proof of theorem 5.14 is a modification of 
a method first discovered by L. Adleman for computing threshold 
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functions. A description of this method, which proves that 

MC(TJJ) <: kn+6(n) for arbitrary fixe* k, 1s to appear, and we sketch the 

technique briefly in this section for "threshold 2". 

To compute T-, the n variables are arranged in a 
WxW square matrix X. Let R be the disjunction of the 
variables of the 1 row of X, and let C. be the disjunction 

th 

of the variables of the j column. Then observe that 



Or? D? 

T£(x r ...,x n ) -T ™ (R r ...,Rr /ri |)v T ™ (^ C W ) 



since each pair of different variables in the matrix differ either 
in their row numbering or their column numbering:. This construction 
give rise to the recurrence relations 



MC(T!!) s 2n - 2 m +1 +'-2MC<T2*^ ) + 1 



and MC(T|) = 1 



since the row and column sums can be constructed with at most 
2n - 2T/n 1+ 1 gates. This method shows that 
MC(t!J) <. 2n + 2^n + 0(n 1/4 ). 

One problem with the above method is that it does not necessarily 
yield an M-circuit for To with the minimum number Togr? of 
A-gates. For example, for n = 8 the resulting circuit has 
4 A-gates. The circuit for F n described in Theorem 5.14 uses 
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the same idea except that the variables are arranged into a 
rectanfle with sides which are an exact power of 2. While the 
method of Theorem 5.14 yields an M-circuit with exactly Tog n 1 
A-gates, it does so at the expence of increasing slightly the 
total number of gates. A close examination of the recurrence 
relationship for C {v} (F n ) shows thai, fh the natation of the 
proof of Theorem 5.14, either r^ s «fi and n 2 s /2n . or 
n 1 s »^n and n 2 z Wn. Hence this method yields ah M-cfrcuit 



for T 2 with at most 2n+(l+^) sffi +4^-)'^gftes." " '(lior some 
values of n the recurrence fs^better^ The table below describes 
the best known constructions for some small values of n. 



n 


Cor. 5.20 Lower 
Bound if log n 
A-gates 
(Total Gates) 


Construction of 
Thins. 5. 12 and 5.14 

; (TotaT) (Abates) 


Adleman's 
Construction 

{Total) (A-Gates) 


2 


-1 


1 1 


-1 1 


4 


7 


7 2 


7 2 


8 


19 


19 3 


19 4 


16 


39 


39 4 


39 4 


32 


75 


79 5 


79 6 


64 


143 


151 6 


151 6 


128 


275 


291 7 


290 8 



Table 5.1 The Complexity of T« 
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Observe that for n ■ 128, the most efficient known M- circuit 
for T^ does not use the minimal number ( log 128 ) of A-gates. 

Section E. Open Questions 

There are a host of open questions raised by the above 
results: 

(1) We know that C {v} (F n ) s 2n + 0(/n). Determine 
C^(F n ) more exactly. 

(2) Does an optimal M-circuit for Tjh*^ exactly Tog n~l 
A-gates? 

(3) Is there a minimal M-circuit for t!) which is single-level? 
Is there one for an arbitrary quadratic function? 

(4) Is there an M-circuit of size c?n»los[ n f<"* T J for 
arbitrary k < n where c is independent of k? Is 
there an M-circuit of size ^'log k+lV for TjJ for 
fixed k? 
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CHAPTER 6 
The Complexity of Monotone Functions In Other Bases 

The objective of this chapter is to study whether the complexity 
of a monotone Boolean function can be reduced by usjng a basis other 
than {#,v} in the formulas or circuits considered. That this is 
the case for Boolean circuits and multi-output functions is well-known 
and several examples exist. For instance, Paterson [1975] and Mehlhorn 
and Gal i 1 [1976], extending work originally done by Pratt [19741, have 
shown that the M-circuit complexity of the Boolean matrix 
multiplication of two n*n matrices is exactly 2n^ - n 2 . On the 
other hand, Fischer and Meyer [1971] have shown that Strassen's fast 

integer matrix multiplication algorithm can be used to multiply two 

2 81 
Boolean n*n matrices on a Turing Machine in time 0(n '). Using 

Fischer and Pippenger's results connecting time-bounded Turing Machine 

2 82 
computations and circuits, this means that a circuit with 0(n ) gates 

for matrix multiplication exists in any complete basis . Paul [1976] 
has improved this gap by exhibiting a series of monotone functions 

f€B n,n for which MC(f) = " (r2/ Oogn) 3/2 j but for which C g (f) 
= 0(n log 2 n). 



+ Recall that the complexity over different complete bases is always 
related by a constant factor, hence it does not matter which complete 
basis is used. 
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It remains an open question as to whether such a gap between 

C R and MC exists for a single-output function. Pippenger 

2 
[1976] has shown that if one looks at the monotone function in B n 

with the worst MC complexity among all m.b.f. 's in B , then Its 

MC complexity is no more than a factor of O(log n) larger than 

its C R complexity + . On the other hand, several researchers have 

2 
conjectured that a large, even exponential gap may exist between the 

MC and Mg measures for a specific monotone Boolean function. 

In contrast to this, it is possible to show that there are 
specific m.b.f. 's for which allowing a complete basis in the circuit 
does not allow a saving of more than a constant factor over the 
monotone case. This clearly the case for a function of minimal 
M-complexity which depends on all its arguments, and hence has 
M-circuit complexity of n-1. The situation can occur for more 
complex functions in addition. For example, in Chapter 3,. we 
demonstrated that most quadratic m.b.f. 's of n variables have 
%2 circuit complexity of n{n /log n). Since each quadratic 
m.b.f. can be realized in an M-circult which has 0(n /log n) 



In fact, Pippenger conjectures that among all monotone functions 

the worst case values for MC and for C D are asymptotic, and 

B 2 
offers a reward of $100 for proof of this conjecture. 
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gates, it is not possible to save mows than a constant factor in 
circuit size for most quadratic functions. 

We consider the formula size of functions in the remainder of 
this chapter. Here, the choice of complete basis will have a 
greater effect on complexity as pointed out by the work of Pratt 
[1975] - we consider the bases M, B?» and U. One may make comments 
about the worst-case values for these measures for all monotone 
Boolean functions similar to those voiced for circuit complexities- 
it is the complexity relationship for individual functions which 
concerns us here. We show that this relationship can depend on 
the functions involved. Clearly, there are functions of complexity 
n-1 in all three measures for which no savings can be had. In work 
done jointly with M. Paterson, we do show £hat there are m.b.f. 's 
of M-formula complexity e(n 2 /log n) for which any formula in any 
complete basis must be 6f size nCng/log n), and hence for which 
not more than a constant factor savings may be achieved. On the 
other hand, we exhibit a particular sequence of m.b.f. 's which 
have M- formula complexity e(n ), but for which linear sized formulas 
exist over the basis B«. This sequence is derived as a corollary 

of Khrapchenko's result that any U-formula, for the parity function 

2 
must have size n . While this example affirms that the complete basis B„ 

may allow more succinct expressions for some m.b.f. 's, it does not 

answer the question of whether the addition of negation to the basis 

M allows a more compact expression for some functions. This latter 

question remains open. 
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Section A. A Case, where f Complete tests Doesn't Help 

We will utilize a method introduced by Nec*iporuk [1966] to 
establish a lower bound on the formula size of a particular mono- 
tone function. This method has been used by other authors in ref- 
erence to other problems [Harper and Savage 1972, Paul 1977, and 
Lamagna 197*]. While the technique applies to formulas over arbit- 
rary bases, we restrict our treatment here to the binary basis B 2 - 

Notation : Suppose f«B 1s an arbitrary Boolean function of the 
variables Xf»...«x » and suppose Y six^ I 1«£l:nJ> is some 
subset of variables. Let S(f ,Y) denote the set of all restric- 
tions of f to Y obtained by setting- all remaining variables not 
in Y to be constant; that is, 

Slf.Y) - <f I x . Cx for ^ IV<°»1)"- |Y| >• 

6.1 Lemma: Suppose f,Y, and S are as above. If F 1s any B 2 - 
formula for f, and F contains kV2- occurrences of Y variables, 
then 

|S(f,Y)| slS*" 1 

Proof : We prove this fact by Induction on k. Suppose f,Y,S, 
and k are as in the theorem. 

In the event k=2, let x 1 and x. be the Y-variables occurring 
in F (where possibly i = j). Since no other Y-variables besides 
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x. and x. appear in F, all restrictions f | £ _ + for ^y 
of f to Y are functions of the variables {x. ,x.} . Since there 
are at most 16 such functions, this proves the lemma in this case. 

Now suppose that k > 2, and the lemma is true for all smaller 
values of k. In the tree corresponding to F, there exists a 
subtree of F with some number i of occurrences of Y variables, 
where 2 £ i <. k-1. This is true since the sons of a node in a 
binary tree create a partition of the leaves above the node. 
By passing upward* from the root of F, one may find such a subtree 
since each leaf contains at most 1 occurrence of a Y-variable. 
Thus we can decompose the formula F as 

F(x 1 ,...,x n ) = G(H(x 1 ,...,x n ),x 1 x n ) (1) 

where H has % occurrences of Y-variables and 6 has k-s. occur- 
rences of Y-variables. Let g(z,x-, ,x 2> .. .x ) and h(x, jx^.-. .x n ) 
be Boolean fucntions defined by G and H respectively. 

We may bound the number of restrictions of f to Y by bounding 
the number of restrictions of g and h. Let Y' denote the set 
of variables Yu{z}. Then G has exactly k-£+l < k occurrences 
of Y" variables. By (1), since any restriction of f to Y 



t 

In a formula, as in all circuits, we think of the constant and input 

nodes as being at the top of the formula and arcs directed 

downward. 
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satlsfles 



for xh for x^Y ' for x^Y 



there can be no more restrictions of f to Y than the product 

|S(H,Y)|.-'|3(g.Y-)j. 
By Induction, this implies that 

|s(f,Y)| .is 1 " 1 • le^- 4 * 1 *- 1 -i6 ktol 

and the proof is complete. D Lemma 6.1 

6.2 Theorem (Nec'iporuk) : Suppose . feB is a Boolean function and 
P-,»P 2 »---»P r is a partition of the variables {x. | ie[l:n]} such 
that f depends on some variable in P^ for each ie[l:r]. Then 



Lo (f) * V4 I lof|S(.f,P 1 )|- . 
B 2 j=l J 



Proof : Suppose F is a minimal Bg-formula for f. For each 

ie[l:r], let k^ be the number of occurrences of P^-variables in 

k -1 
F. If ^ * 2, then |S(f ,P 1 ) j s 16 i by Lemma 6.1, so 

k^l * log^lSff.P^I = 1/4 log 2 |^(r.P 1 )|. If k 1 . - 1, then 

|S(f r ,P 1 ) | <; 4, so k 1 a 1/4 log |S(< r ,P 1 ) j ; the case k f - cannot 
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happen since f depends on some variable 1n P.. In any event, 

k. ;> 1/4 log |S(f,P.)|. Since L (F) - ?*., this proves the 
1 1 B 2 i=l 1 

theorem. D Theorem 6.2 



We may now use this theorem to establish lower bounds on the 

complexity of particular m.b.f.'s. Note that if n. = |p. I for 

n • i ' i 

ie[l:r], then |S(f r ,P i ) | s 2 2 1 since this is the number of Boolean 

functions of n i variables, and ISff.^)!* 2 n_n i since this is the 

number of settings of variables outside P.,-, Choosing n-j ~ log n for 

each i gives a partition of the variables Into ~ n/log n sets with a 
possible maximum value of log|S(f,P.)| of n-log n, to yield a 

theoretically possible fi(n 2 /log n) lower bound using Theorem 6.2. 



It is possible to show that the maximal lower bound obtainable from 

p 
the theorem is ft(n /log n). 

Using an example similar to Nec'iporuk's original function, we 
establish the following example: 



6.3 Theorem (Paterson, Bloniarz) : For every n, there is an m.b.f. 
feB n such that 



and 



(A) L R (f) = fi(n Z /log n) 
d 2 



(B) ML(f) = 0(n 2 /log n), 
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so that allowing a complete basts in a formula for f does not 
allow more than a constant factor reduction in size. 



/ m 'N 

Proof : Suppose for the moment that n = l(m+l)/2J where m 
is an odd integer. Note that m divides n. To define f, we 
regard the n input variables as arranged in a (n/m)xm rectangu- 
lar matrix x... For each ie[l:n/m] and j«[]:ig]» we will fix 

«— n — *+ 



n/m 



I 




Fig. 6.1 The Inputs 



a., to be a distinct subset of [l:m] of exactly I (m+l)/2 j 
elements (which 1s possible by our choice of n). Define 
f:{0,l} n ■* {0,1} by 



f( *ij> " iecYn/m] C*ij A < k^l :n/ml < q£ "V 53 

"J 

je[l :m] k^i 



(2) 
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Intuitively, we may think ©f the a^'s' as row patterns— - 
(m+1 )/2 entri es whi ch wi 1 1 be chosen from a row of. jn variables. 
If ke[l:n/m] and q A^ x^ is true, we f win $ay that row k 
satisfies pattern a... Note that this implies that row k has at 
least (m+l)/2 ones in it. The clause 



ke[l:n/m] ^qe9.. x kq> 
k f i 1J 

is one if some row other than the i tn satisfies a... Finally, 
f(x-) = 1 if there is some variable x. . ecttiafl tdl Whose cor- 
responding pattern a., is satisfied by some row other than the 
i . We note in passing that f is monotone by Theorem 2.4. 

To obtain a lower bound using Theorem $.2, consider the rows 
P i = {x.j | j e [l:m]} for ie[l:n/m] to be a partition of the input 
variables. We obtain a lower bound on |s{f,P 1 )|. W.l.o:g. 

assume that 1=1; we show that a large number of settings of the 
remaining variables yield different restricted functions of f 
to P r In particular, suppose c.. for i € [2:n/m], je£l:m] 
are constants such that each row of constants has fewer than 
(m+l)/2 ones in it (that is, |{j | c.. = 1}| < (m+l)/2 for each 
ie[2:n/m]). We show that two different such settings of the 
variables x^ = c^ for ie[2:n/m], j e [l :m] yield different 
restricted functions of the first row. 
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Let % denote the restriction f | ■ „ _ for 1e[2:n/m], 

x ij c 1j 
je[T:m], The first row is the only possible row which can satisfy 

any pattern since each other row has fewer than (ro+l)/2 ones in 
it. Hence the inner conjunction is false for k f 1, and 

f c i c [2:n/m] U ^°ij ^ } 
Minn] 

Now if t f c' are two distinct such constants, w.l.o.g. let i 
and j be indices such that c^ - * and cj» ;« 0. Define an 
input for the first row by 



1 if qco.. 
x lq = { for each qe[l :m] 

if q/a^ 

and note that the row x, satisfies pattern a., and no other. 
Hence f^(x ]q ) = 1 whereas fj'lxj ) = 0, and the two functions 
f c and f c' are different °" the input x 1 . 

Thus |S(f f »P 1 ) I is at least the number of such settings of the 
remaining rows with fewer than (m+l)/2 ones per row. Since there 
are 2 length m binary vectors with fewer than (m+l)/2 ones, 
we obtain 
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|S(f P )| ^ r 2 n>-l-j(n/ni)-l _ gMft/nO-tiH'l 



Hence, by theorem 2, 



(n/m) 
U (f) ^ 1/4 - J [n-(n/ra)-ra+l] 



■ 1/4 C(n 2 /m) - (n 2 /m 2 ) - n + (n/m)]. 



By Stirling's formula, [Knuth 1968, p. 46], 

n = ((m+U/2V~ fiTum -2 n 



2 9 

so m ~ log n. Hence U (f) » a(n /m) * a(n /log n) and the lower 

B 2 

bound is established. 

The upper bound on ML(f) is obtained by expanding and fac- 
toring the definition (2) to get a more compact formula for f. 
Since f is not explicitly defined, we prove that an M- formula of 

size 0(n /log n) exists without actually exhibiting it. Clearly, 

2 
f has an M-formula of size 0(n ), namely (2). 

To get a smaller formula for f, we can distribute the variables 
in (2) 'to get the disjunctive form 

f(x) = i £ [l:n/m] ke[l:n/m] (x ij A qea.. x kq ) " (3) 
je[l :m] k M 1J 
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If we define, for kc[V:.n/m}» 
f k (x) - i 



flY/m] (vN^.^kqV ^ 



l:m] 



then by reversing the indexing in (3) we obtain? 

f(x) - V f (*) 
k«l * 

n/m 
so ML(f) s V ML(f.). To bound ML(f.) for a fixed ke[l:n/m], 
k«l * * 

note that each monome in Pl{f^) consists of the distinct product 
of (m+l)/2 variables from row k (which has m variables in all) 
and a single variable not in this set. Since the number of prime 
implicants of f k far exceeds m, by factoring (4) on the vari- 
ables in row k we obtain a smaller formula for f. . 



6.4 Lemma : Suppose r.sclN , and g«B^ is a function such that 

g(x,,...,x #,,,..#_) - V (y. a A x.) 
1 r I s 1al i ^ j 

where,. for each i, A* is a unique subset of size a of [l:r]. 
Then 



ML(g) s (***) + O - 1. 
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Proof of Lemma : We proceed by Induction on r. If r = 0, then since 
there is a unique subset of size of 0, g(yi) is either y-| 
or the constant function 0. In either case, M4s) * ,1 - (q) + (q) - 1. 

If r >0 -and a » r, then clearly g(x-,,. ..,x r> y ) = 
yi A Xl A...AX r or g = 0, so MtO?) 4 ' **l>* (y)*$^- 
Similarly, if a » 0, then again ML(g) s 1 = (""J 1 ) + (JJ) - 1 . 
Now suppose < a < r. By factoring the definition of g by x r , 



we may express 



g = (v g i) v g 2 ( 5 ) 



where 



9 1 = i^T (y i A A x i) 



jeA.-fr) 



1 



and 



9 2 = u[l:s]^V^V 



and T is the set of indices le[.l:s] such that rd^. Note 
that each subset A i \ {r} for ieT has a-1 members chosen 
from [lir-l], and hence inductively 



*<'i> * Ci> + O - ' 



Similarly, each set Aj for i7T has a members chosen from [l;r-l], 
and hence 
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WL(g 2 ) * (J> + (^ 1 )- V. 



Thus, by the expression (5), 






r 



Induction) 
<rjl) ♦ (^ - l ( U ^g tne fact that (J) = (k-j) + ( k-l } 

[Knuth 1968, p. 54]). 
□ lemma 6.4. 



Now, sinceLemma 6.4 a pplies to f k (*) with r = m and a = (m+T)/2, 
we obtain 



ML(f. ) <; f (m+1 )/2) + ((ffl+l)/?)) - 1 

= ((m+l)/2) + |(m-l)/2\ + Hm+l)/2^ 



3n - 1. 



n/ra 
I(3n-1) = 



,2 /m _ rt/ „_«^2, 



Hence ML(f) <. I (3n-l) = 3nV«n - n/m = OfnVTog n). 
k»l 

To finish the proof, for n not of the form [(nH-U/2/ Cm odd), 

we could choose n Q of this form with (n/4) <. n Q <; n and merely 

ignore all but the first n Q of the n arguments. 

U Theorem 6.3 
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Section B. A Case where B~ Can Help . 



In a counterpoint to the previous case, we present an example 
where the larger complete basis B 2 can allow a reduction in formula 
size of more than a constant factor. The example is a monotone 
cover of the parity function. 



Definition: Suppose f e B n is an arbitrary Boolean function. An 
m.b.f. 9eB 2n is said to be a monotone cever of f if 

f(V' X rt> = 9( x i»7 x i» x 2»~ ,x 2 , "» x nV" ,x n^ 



Example : If P n (x^ , . . . ,x n ) = Kf x 2 " Px n is the parity function, 
then g n defined by 



g n (x 1 ,y 1 ,x 2 ,y 2 x n ,y n ) = [P n (x 1 ,...,x |) )A | A , (x i vy.)]v i V (x.Ay.) 

1*1- i B l 



(6) 



is easily verified to be a monotone cover of P n . 

We claim that the function g n exhibits the properties desired. 
We use the following result of Khrapchenko [1971] which we state 
without proof. 

6.5 Theorem : If ne^N , then 



n 2 < Lyfp") < 2n 2 
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The fact that L^p") < 2n is easily -proved by recursively 
dividing P n (x-,,... ,x n ) as (Xj® ... ©x .„ ) ©{x ■,« +1 ®...©x n ). 

The lower bound is established by a remarkable argument which uses 
the isomorphism between series-parallel contact- networks and U- 
formulas. 

Assuming Theorem 6.5, it is straightforward to show that 

n 2 < ML(g n ) s 2n 2 •► 4n ,....- 

To derive the upper bound, suppose thit F is a minimal U-formula 
for P n . By applying DeMorgam's laws, we may distribute all nega- 
tions in F to the variables without changing the size of the for- 
mula, so we assume w.l.o.g. that F has this property. Let F' 
be the formuTa obtained by replacing all occurrences of -iXj in 
F by the literal y 1 (for all lefhn]). A straightforward 
argument demonstrates that 

n n 

[F' a fa (x.vy i );|v i y i (x i Ay i ) 

is an H-formula for g n (x,,y,,...x ,y n ). 

For the lower bound on ML(g n ), suppose that F(Xj,y.|»... »* n »y n ) 
is any M- formula for g n . By replacing y. by -1 x^ in F for 

each ie[l:n], we obtain a U-formula for P n , which by Theorem 6.5 

2 
must have n occurrences of literals. Thus F must have had 

2 
size n . 
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Since P n has a B, formula of size n-1 , L„ (g n ) <; 5n-l 

2 B 2 



Since g n has 2n variables, we have proven 



6.6 Theorem(Bloniarz, Meyer) : For every nelti there is an jn.b.f. fcB 



such that 



ML(f) = e(n 2 ) 



and 



L R (f) <; 5n/2 +0(1) . 
D 2 



Note : Because of Theorem 6.5, any U-formula for g 
must also have at least n occurrences of variables. Thus this 
example does not answer the question of whether the addition of 
negation to the basis B allows more compact expressions for m.b.f. 's. 



Section C. Open Questions 



(1) Is there any single-output m.b.f. for which 
Ly(f) < ML(f); for which C g (f) < MC(f) ? 

(2) Is there a larger gap than that given in Theorem 6.6 

between ML and L„ ? 

B 2 

(3) Do larger gaps between these measures exist for multi -output 

functions? 
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CHAPTER 7 
A Shortest -Path Algorithm 

This final chapter is concerned with the problem of finding 
the shortest distance matrix for a non-negati vely weighted directed 
graph. Several algorithms to solve this problem have been proposed 

[Dijkstra 1959, Floyd 1962* Spira 1973, Eredman 197S]* the best 

3 1/? 1/3 

worst-case running time being 0(n (loglog n) /; /(log n)'- ) 

[Fredman 1976). However, Spiral algorithm, which has a worst-case 

3 2 2 

time of n(n log n) t has average running time @(n (log o) ). 

Spira's algorithm basically searches for the diodes closest to 
a given source node. By searching along edges WmiMmm Weight first, 
it is likely that the shortest paths from the source node to all 
other nodes will be discovered before all arcs in the graph need to 
be traversed. In fact, starting from any given source, on the 
average only 0(n log n) arcs need to be traversed. By repeating 
this process for all r. source nodes, the shortest path matrix can 
be found. 

In Appendix 3 we show that several lacunae remain in Spira's 
algorithm. In particular, Spira does not specify any search pattern 
when there are paths from the source of equal length. We observe 
that arbitrary choices among paths of equal length do not solve the 
problem efficiently; inappropriate "tie-breaking" rules dan result 
in algorithms which have fl(n ) running time on almost all 

matrices. One example of this phenomenon is presented in Appendix 3. 
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Our main result (this work was done jointly with A. Meyer 

and M. Fischer) is a correct vers ion of Spira's algorithm, which 

2 2 

we prove does indeed run in 0(n (log n) ) average time for a 

broad class of probability distributions on directed graphs with 
MHHnegatlve weights. These distributions properly include the 
distributions for which Spira originally made his claims. Infor* 
mally the condition characterizing distributions for whtch our 
results hold is that Information about the entire graph except 
for the arcs emanating from any given node i, together with s 
information about the weights of the arcs emanating from node 1 » 
is not correlated with the assignment of the weights to those 
arcs — all permutations of assignments are equally likely. 

Finally, a modification of thts algorithm is noted which 
computes the transitive closure of a Boolean matrix in average 
time 0(n log n). This result has recently been Improved by 

2 

Schnorr [1973a]who exhibits an 0(n ) average -time algorithm for 
this problem. 



Section A. Shortest Paths and Graph Distributions 

Directed graphs were defined in Chapter 2. We henceforth 
assume that every n-node graph has a set of vertices V = [l:n]. 
A graph (V,E) is weighted if there 1s a cost function 

c:E -»-F u{-»,«} which assigns a weight or cost to each edge in the 

* 
graph; the graph is non-negatively weighted if range (c) c P . 
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We consider only weighted graphs of this latter type and refer to 
them as weighted graphs. 

One standard representation of a weighted graph is the nxn 
cost - matrix C, where C(i,j) is defined to be the cost of edge 
(i,j) if edge (i,j) € E, and C(i.i) j» ~ if (nl^E 1 ". One 
additional representation will also be used. In the sorted list 
of edges representation, an n-node graph G is represented as a 
sequence of n adjacency lists . Each list element is a pair (a,w), 
where a is a vertex of G and w is a weight in F *. The i th 
list 

L i = ' ((a 1T w nV a l 2 * w 1 2^";< a 1k 1 - w 1k l ) J 

represents the k^ edges emanating in G from node i (including 
possibly an edge from i to itself); a i . is the endpoint of the 
arc and w^ is its corresponding weight. In addition, we 
require that the weights in each adjacency list be arranged in 
increasing order w^ <; w- 2 s ... s w iR ++ . In the case where 



We do not distinguish between missing edges and edges of weight 

tt 

The definition of < and + on p + are extended to r* in 

the usual manner. 



-180- 



there are multiple edges in the adjacency list of the same weight, 
we assume that the endpoints are placed in the list 1n some 
prescribed order; for definiteeess we assume that they are in 
order of increasing node index. + 

We wi 1 1 represent the adjacency 1 ists as two nxn matri ces - 
the endpoint matrix A and the edgecost matrfx W, where 



A(i,k) = 

and 

W(i,k) = 



a ik if k s k, 
NIL if k > kj' 



w. k if ksk/ 
if k > k 1 



(NIL is a special symbol )". Figure 7.1 exhibits different represen- 
tations of a particular weighted graph. 

Suppose p =(v r v 2 ,...,v k ) is a path in G, that is, a 
sequence of k*l nodes such that (vj,v 1+l ) € E for ie[l:k-l]. 
The cost of path £ is the sum of the costs of Its edges; that is, 

k-1 

•L c ^ v i' v i+lH where the cost of the trivial path (v) from v to 

v is 0. The minimum cost matrix M of G is the nxn matrix 

which has M(i,j) equal to the minimum of the costs of all paths 

in G from i to j. The shortest distance problem is that of 

computing the minimum cost matrix M given the cost-matrix C of 

a weighted graph. 



This assumption is actually unnecessary for either the correctness or 
the timing of our algorithm. 
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G will denote the set of all n-node weighted graphs. The 

subscript n will usually be omitted. Through the cost-matrix 

* 2 
representation of graphs, we will identify G with [K ] n . 

A map n:G-+ 6 is a row permutation if it simply permutes the 
endpoints of the edges emanati ng from a specified node while leaving 
the remainder of the graph intact; formally, there must be a permu- 
tation p of [l:n] and an index i Q e[l :n] such that, if G is any graph 
in G with cost-matrix C, and 
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C is the cost matrix of n(G), then 



C n (1,3) = 



C(i,j) if 1 f 1 
C(MU)) if i - i 



for every (i,j) e [l:n] . The set of row permutations generates a 
group T of transformations on G ; each member of T^ Is called 
an adjacency transformation . 



If P is a probability measure on G, its distribution function 
P : 



is the function F p :[ R J-* {xeR i Oixsli defined by 



F p (X) =P{C e Gj€(1,J) » w n>r (i»J)«D:n] 2 } 

where X is any element (Xj. | (i,j)e[l:n] ) of [ R ] n . 
Every probability distribution is uniquely characterized by its 
distribution function. 

We will say that a probability measure P on G is adjacency 
invariant (A.I.) if any adjacency transformation ncT n Is a 
measure preserving transformation oa G; that is, F p (n(x)) = F p (x) 
for all xe[ R ] n . We will consider specific examples of contin- 
uous A.I. probability measures later in this chapter. For a 
discrete probability measure on G, note 'that the uieasure is A.I. 
if and only if the probability of any graph is equal to the 
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probability of the graph obtained by permuting the endpoints of 
the edges emanating from an arbitrary node in the graph. 

For any function f:G+F and any probability measure P on G, 
let Ep(f) denote the expected value of f with respect to P. 

Section B. The Algorithm 

Before describing the algorithm, we make an observation about 
weighted graphs which serves as basis for the algorithm. Suppose 
that G is a weighted directed graph, and that i is an arbitrary 
"source" node of G. Define, for any node jeNODES(G), D(j) 
to be the minimum of the costs of all paths in G from i to j, where 
D(j) is » if there is no path from i to j. Let NEAR be a 
subset of the nodes of G such that ieNEAR; those nodes in NEAR 
are called near nodes , and all other nodes are called far nodes . 
For each near node j, we define REALK(j) to be the far node 
k of G such that C(j,k), the cost of the edge from j to k, is 
minimal among the costs of any edge from j to a far node. If j 
has several edges of equal minimal cost emanating from it, then 
REALK(j) is arbitrarily selected to be one of them. REALK(j) 
is undefined if j has no arcs to any far node. 

We make the following observation. 

7.1 Lemma : Suppose that G is a directed graph, and that NEAR 
satisfies the property that for every jeNEAR and MNEAR, 
D(j) ^ D(k). Let ,i„ be a near node such that the sum 
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D(Jq) + C(o .» REALK(j Q ))is minimal among all near nodes, and let 
k Q = REALK(J ). Then 

(D D(k Q ) = D(j Q ) + C(j ,k ), 
and (2) D(k Q ) < D(k) for every MNEAR. 

Proof: Part(l) . Since D(j Q ) is the cost of some path from 
i to j Q , we know that D(j Q ) + C(J ,k Q ) is the cost of a path 
from i to k Q , and hence D(k Q ) £ D(j Q ) + C(J ,k Q ). Suppose to 
the contrary that D(k Q ) < D(j Q ) + CUg.kg), and let 

V" v l» v 2»v"» v t " k o be a patn of weight DfkJ from i to k 
Observe that i > 1 since ieNEAR. Let t Q be the least index in 
[1:1] such that v. ^NEAR; observe that 1 < ju s s! . Thus 
v ,eNEAR, and hence 

o ■■■■■■;... 

^V 1 ' + c( Vr REALIC( Vi > * D( V l} + c( V r V (by - def of REALK) 

V 1 
* ^ C(v m* v m+1 ) (since v r ...,v 



m=l 



■ D(k ) 

<D(J ) +C(j ,k ). 



V 

is a path to v. ,) 
*0~ 
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But this latter inequality contradicts our choice of j Q and 
part (1) is proven. 

Part (2) . Suppose to the contrary that k is a far node such 

that D(k) < D(k Q ). Let i = w ] ,w 2 w^ = k be a path of 

length D(k) from i to k. Again choosing ** to be the least 

index in [l:*] such that w. ^NEAR, we must have 1 < z n <, a 

x, Q u 

and w -.eNEAR. Thus 
V 

D(w _,) + C(w , REALK(w _,)) s D(w _-,) + C(w, ,, w. » 
V * Q *0 £ 0' 



A-l 
5 J 1 C( Wl > 

m-l 
-D(k) 
< D(k Q ) 
- D(J ) •* C(j ,k ), 

which is again a contradiction to our choice of j Q . 

D Lemma 7.1 

By the above argument, if NEAR satisfies the hypotheses 
of Lemma 7.1, then so does NEARu{j Q }. 

We may use Lemma 7.1 as basis for an algorithm for constructing 
the minimum cost matrix M for a graph 6. We first present a 
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version of the algorithm which uses certain simple operations on 
finite sets. We later note how these set operations can be imple- 
mented in terms of standard operations and data structures on random- 
access computers. 

We assume that the graph is presented in its sorted list of 
edges representation as matrices A and W. Extraction of A and 
W from the cost matrix C may be performed by sorting the rows of 
C in a stable fashion using edge weights as keys. The algorithm 
constructs the minimum cost matrix M one row at a time; this cor- 
responds to finding the shortest distance from one particular node, 
say node i, to all other nodes in the graph. The algorithm 
searches from node i in a manner dictated by the weights of the 
edges; shorter paths are searched first. 

In the algorithm, NEAR and D are as described in the hypoth- 
eses of Lemma 7.1 with one exception - D(j) is only defined for 
near nodes. If j is a far node, then D(j) = NIL. Initially 
NEAR is set to {i} and D(i) to 0. The algorithm halts with 
NEAR equal to the set of nodes which are reachable by paths from i. 



A sort is stable if the relative order of elements with keys of 
equal weightis preserved at the end of the sort (cf. Knuth [1973 
Ch. 5 ]). 
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Inductively, to add a new node to NEAR, the algorithm attempts 
to find a node j Q satisfying the hypothesis of Lemma 7.1. A 
priority queue is used to direct this search (see [Aho, Hopcroft, 
and Ullman 1974] for definition). In the course «f execution of 
this algorithm, certain edges on a near node's adjacency list will 
be "examined"; initially no edges in the graph have been examined. 
Inductively we assume that all edges previously examined point 
to near nodes, and that all examined edges on an adjacency 
list will consist of those edges of least cost. A pointer P(j) 
is maintained for each node j in [ltn]; its value is the index 
in matrices A and W of the next (>east-cost)> unexamined edge 
on j's adjacency list. P(j) is initially set to 1 for all nodes. 

Each near node j has an additional value associated with it, 
which we will call KEY(j). For these nodes, KEY(j) is the sum 
of D(j) and the weight of the next unexamined edge from node j; 
that is, KEY(j) = D(j) + W(j,P(j)). Since the examined edges on 
j's adjacency list all point to near nodes, we know that 
KEY(j) ^ D(j) +C(j, REALK(j)). 

To add a new vertex to NEAR, the algorithm selects a near node 
j such that KEY(j) is minimal among all near nodes. Suppose 
that k = A(j,P(j)) (the endpoint of the least -cost unexamined edge 
from node j) is a far node. Then j « j Q satisfies the conditions 
of Lemma 7.1 since 
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D(j) + C(j.k) « KEYU) * «EY'(j") 

s D(j') "+ C(j;, REALK(j')) 

for any other near node j', and since C{j,k) is the least cost of any 
edge from j to a far node( all examined edges point to near nodes). 
Hence k can be added to NEAR and D(k) can be set to KE¥(j) 
and still preserve the inductive hypotheses of Lemma 7.1. 

On the other hand, if node k were already a member of NEAR, 
then its shortest distance is already known, so nothing is done. 
In either case, since the edge from j to * * A(j,Pfj)) has been 
examined, the pointer P(j) is moved to the next edge from $. 

This search is continued until either all nodes are found to 
be in NEAR or until there are no further edges to examine, in 
which case it follows from lemma 7*1 that NEAR is the set of all 
vertices which are reachable by paths from 1(s1nae a missing edge has weight-), 

One additional requirement is imposed in order that the general 
algorithm given above have fast running time. If the node j in 
NEAR from which we are searching has several edges of equal weight, 
then these edges are considered in consecutive executions of the 
search loop. That is, if W(j,P(j)+T) = W(j,P(j.) ), then we require 
that the algorithm select j as the near node from which to search in 
the next execution of the search loop + . 



Together with the specifications on the tie-breaking rules of the 
priority queue below, this requirement avoids the difficulties in 
Spira's algorithm noted in Appendix 3. 
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To select nodes with appropriate KEY values, we use a priority 
queue. Three priority queue operations are necessary. CLEARQ 
sets the queue to the empty state, MINQ returns and removes from 
the queue a pair (KEY(j),j) such that 4 is a node in NEAR for 
which KEY(j) is a minimum among all nodes in NEAR. If the queue 
is empty, MINQ returns the pair (NIL.NIL). INSERTQ (d,j) inserts 
node j into the priority queue and sets KEY(j) to d . In case 
there are several nodes in the queue with minimal KEY values, 
MINQ selects and returns any one of them. (We assume only that the 
queue operates so that the value returned by a MINQ operation 
depends solely on the previous sequence of queue operations.) 

The fully specified algorithm, R i , which computes the shortest 
distance D(j) from node i to node j is given below in a 
PASCAL- like language, where » is represented by NIL: 
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Algorithm R. 
FOR L := 1 to n DO D(L) := NIL : 
FOR L := 1 to n 00 P(L) :« 1; 
NEAR := {1} ; D(i) := 0; J := NIL; CLEARQ ; 
IF W(1,l) f NIL THEN <KEY,J> := <W(1,1),1>; 
Commit If there 1s an edge emanating from 1, then KEY Is set 

to the minimum weight of all edges from 1. 
WHILE J / NIL DO 
Comment J 1s the near node to be searched from. If REALK 1s 

defined for some near node, then J f NIL . 
BEGIN REPEAT K := A(J,P(J)); 

Coment K Is a candidate for REALK(J). 
IF K^NEAR THEN 
Comment K is REALK(J). 

BEGIN NEAR := NEAR u {K}; 
D(K) := KEY; 

lf_ JNEARJ - n THEN HALT; 
IF W(K,1) /NIL THEN INSERTQ ( KEY+W(K,1),K) 
END ; 
P(J) :- P(J) + 1 
UNTIL W(J,P(J)) f W(J,P(J) - 1); 
Commut Examine all equal -weight edges emanating from J 

at the same time. 
JF W(J,P(J)) / NIL THEN INSERTQ ( D(J) + W(J,(P{J)),J); 
<KEY,J> := MINQ 
END 
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To compute the shortest-distance matrix M, algorithm R. is 
run for each ie[l :n]. 



Shortest-distance algorithm 



FOR 


I := i to n 
BEGIN 
R,; 


00 



FOR J := 1 to n DO M(I,J) := D(J) 



END 
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Sectio n C. Analysis of the Alg orithm 

If G is a weighted graph, we will denote by N^G) the 
number of times the test "IF_ MNEAR" is executed by R i on 
graph G; that is, N.(G) is the number of edges of G examined 
by algorithm R.. Note that only a fixed number of operations are 
performed by R. between successive executions of this test, so 
that the total number of operations performed by R. on graph G 
is proportional to N^G) + 1. Thus we try to estimate N..(G). 

7 .2 Theorem : For any ie[l:n] and any adjacency invariant 
probability measure P on G; 

EpC^) & n log e n 

Proof : Suppose that ie[l:n] and that P is an A.I. prob- 
ability measure on G. If n is any adjacency transformation, then 
n is a measure preserving transformation on G. Hence Ep(N.) = 
E p (N.°ir)). Taking the sum of such expectations over all adjacency 
transformations n, we have 



e p 



(if( N i°"» = nlT (E P !N i ""» 



'n 

T 

Formally, an edge from 3 is "examined" when the pointer p(j) 

is set to the next entry past the edge in the adjacency matrix. 
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thus, to prove that E(N.) * n log e n, it suffices to show that 



E D ( J CN 1 o n)/.|f |) *n Tog ft. 

p H 



We will prove the stronger statement that, for any graph GeG, 



[n!r n H i'H /lr n l 



< n loq„ n. * • 

~ e~ 



So suppose that GeG is fixed, and let G^ be the set 
{n(G) | n e T}. An easy argument demonstrates that the left-hand 
side of inequality (1) is simply the average value of 1^ with 
respect to the probability measure P-j which assigns each graph 
in G, equal probability, and allother graphs probability zero. Hence 
we must show that 



E D {HA-s n log e n, (2) 

p l 1 



Proving inequality (2) requires analyzing the average 
behavior of algorithm R. over all inputs GcGj. A convenient way 
to carry out this analysis is to consider a probabilistic algorithm 
Rr which has no inputs. R: is the same as R f except that where 
R. would reference inputs describing a graph G,, Rj generates the 
inputs randomly. Note that every graph in G^ has the same 
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edge-welght matrix W as 6; it 1s only the endpoint matrices A 
which differ among the members of G, . Moreover, each possible 
endpoint matrix is equally likely, modulo the retirement that 
edges of equal weight in a row be arranged in increasing node order. 

Hence, the probabilistic algorithm R? which we construct 
selects at random and with equal probabilities one of these 
possible endpoint matrices and then executes R, on the graph chosen. 
The selection of the endpoint matrix A is not \&$&! initially but 
rather dynamically as R^ references A (which is done in left- 
to-right fashion across the rows of A). At the first reference 
by R i to entry A(J,K) for any J and K in p:n], a value 
of the endpoint A(J,K) is chosen, Jf J has several edges of 
equal weight emanating from it, then at the first reference to 
any of these a selection of the endpoints formal! these edges of 
equal weight is made and entered in the matrix A in theproper 
order. In all cases, by selecting these endpoints from all possible 
unused endpoints in an independent fashion with equal probabilities, 
one guarantees that every graph in G, will be chosen with equal 
probability. 

R? is obtained by replacing every reference A(J,K) in R. 
by the following probabilistic procedure A*(J,K). In this latter 
procedure, C 1s an internal nxn matrix which A' uses to store 
the endpoints which have already been selected; C( J, K) is 
initially set to nil for every J and R in [l:n]. L is 
the number of endpoints to be selected, and SET is a set variable 
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which collects those endpoints. RANDOM is a probabilistic 
procedure which returns an element of [1 :n] independently and 
with equal probability. MIN returns the minimal value of any set. 
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Procedure A* 

a'(J.K): IF C(J,K) f NIL THEN RETURN (C(J,K)) 

ELSE 
BEGIN 

L := |{K'f[l:n] | W(J,K) = W( J, K')}|; Cotrmznt The number of edges 
SET .- jj - to be selected. 

REPEAT 

B := RANDOM ; 

]F B^{C(J,r) J K'e[l:K-l]}. 
THEN SET := SETu{B} 
UNTIL |SET| = L. 

Comment SET is now a random selection of L un-used 

endpoints of edges from J. 
For J? =J to (J + L - 1) DO 
BEGIN 

R := MIN(SET) ; 
SET : = SET-{R} ; 
C(J',K) := R 

END ; 

Conrn&nt These L edges are inserted into the graph 
weight W(J,K) in the proper order. 

RETURN (C(J,K)) 
END ; 
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We make the following obvious remark. 



7.3 Lemma : Let E Q be the average number of times the test "if 
K/NEAR" is executed over all possible runnings of algorithm R* 



Then E Q - E p (N.) 



To analyze the probabilistic algorithm RJ , note that once a 
node becomes a member of NEARuSET, it will become a member of NEAR 
before the next call of procedure MINQ. This is true since 
endpoints of equal-weight edges from J are considered in succes- 
sive passes of the loop of RJ. Hence, if NEARuSET - [l:n] at 
any point in the algorithm, then at most n further executions of 
the test "IF K^NEAR" will occur before R: terminates. 

Now let E be the expected number of times the procedure 
RANDOM is called in the execution of R^ until NEARuSET = [l:n]. 
At any point in a particular execution of Rr, the number of times 
the test "J_F MNEAR" has been executed is at most equal to the number 

of times nodes have been selected by RANDOM since the endpoint of 
every examined edge was at some point returned by RANDOM . Hence 

E p (N.) = EqS E + n . 

But E is nothing but the expected number. of times nodes from 

the set [l:n] must be chosen randomly with repetitions and placed 

in the set NEARuSET until NEARuSET = [l:n]. This number is well 

known to be asymptotic with n log e n [Feller 1968, p. 225] so 

E P (N.) < n log n. 
r-j -1 ~ 3 e 

D Theorem 7.2 . 
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Section D. Implementation 

There are several efficient ways of implementing a priority 
queue on a random access machine (R.A.M.) [Aho, Hopcroft, and 
Ullman 1974]. In particular, one can use a heap and ensure that 
every execution of MINQ and INSERTQ takes 
0(log n) steps, and every CLEARQ takes 0(n) steps. Standard 
implementations allow computing membership in, and cardinality of, 
the set NEAR with fixed cost. Under these implementation assumptions 

the average number of instructive steps required to execute R^ (for any 

2 
ie[l:n]) on a R.A.M. is 0{n log n) for any AvI. probability on G. 

Since the shortest distance problem is that of computing the 

shortest distance matrix M from the cost matrix C, we must 

include the cost of extracting matrices A and W from C. This 

can be done by sorting at a cost of 0(«^ log n) basic steps, and 

we have the following theorem. 

7.4 Theorem : Implemented as above, the shortest distance algorithm 

? 2 
take an average of 0(n log n) basic steps on a R.A.M. over any 

A.I. probability measure on weighted graphs. 

For may applications, one wants not only to compute the mini- 
mum distance between all pairs of nodes i and j, but also to 
find a path in the graph which achieves that minimum cost. A 
simple addition to algorithm R. enables one to retrieve such a 
path. A on dimensional array PATH is introduced and PATH(a) is 
initialized ,3 NIL for every u[l:n]. When a far node k is 
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added to NEAR by virtue of its being at the endpoint of an arc 
from the near node j, then PATH(k) is set to j. This fully 
specified version of algorithm JL is given below: 



Algorithm P 



i 



FOR L 
FOR L 
FOR L 



:= 1 to n DO PATH(L) := NIL; 

1 to n 00 D(L) := NIL; 

1 ton DO P(L) := 1 . 
NEAR := {i}; D(i) := 0; J :- NIL.; CLEARQ; 
PATH(i) = i; 

I£W(i,l) f NJL THEN <KEY,J> := <W(i,l),i>; 
WHILE J f NIL nn 
BEGIN REPEAT K :« A(J,P(J)); 

IF MNEAR THEN 

BEGIN D(K) := KEY; 

PATH(K) :- J; 

NEAR :* NEARu{K}; 

IF | NEAR) ■ n THEN HALT; 

IF W(K,1) f NJ4 THEN INSERTQ( KEY + W(K,1),K) 

END; 
P(J) := P(J)+1 
UNTIL W(J,P(J)) f W(J,P(J) - 1); 

IF W(J,P(J)) f NIL THEN INSERTQ(D(J) + W(J,P(J)),J); 
<KEY»J> :■ MINQ 

END. 
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The paths from all source nodes may be stored 1n the obvious way. 

A straight -forward Inductive proof shows the following: 
Suppose keNEAR at some stage in algorithm P r Then if we define 
the sequence of nodes : 



w, = k 



w A = PATH(W 1-T ) for £ a 1 in N. 
and 1f i Q is the least Index in IN such that w = i, then 



1 = W l ,w i -l , "" w 2 ,v 'l = k 



1s a path of cost D(k) from i to k- 

A final application allows us to modify the algorithm to 
compute the transitive closure of an (un*-we1gfeted) directed graph. 
If G is such a graph, the transitive closure matrix I*(G) is an 
n*n Boolean matrix defined so. that 



I*(G)(i,j) 



1 if there is a path from i to j in G 
otherwise . 



The transitive closure problem is that of computing I*(G) from 
the n*n incidence matrix 1(G), where 



KG)(i,j) = 



1 if there is an arc from i to j 1n G 

otherwise. 
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We may identify an unweighted directed graph G = (V,E) with 
the weighted directed graph G' obtained by assigning weights of 
zero to all edges 1n E and weight <* to all missing edges'. 
We make the observation that the shortest distance matrix entry 
M(i,j) 1s equal to iff I*(f,j) =1. 

Hence, any shortest path algorithm may be used to solve the 
transitive closure problem. For algorfthms R. and P. , since 
all KEY'S are equal to zero, one may impTentent the priority queue 
in such a way that every insertion and deletion from the queue 
can be performed with constant cost. As pointed out in a previous 
paper [Bloniarz, Fischer, and Meyer 1976] this results in a 
transitive closure algorithm with 0(n log n) average time. 



-2&2- 



Section 1 E. A.I. Distributions 

We conclude by presenting several examples of j*d4aceQcy 
invariant probability measures. 

1. For each ie[l:n], suppose P. is a probability measure 

* 
on f . Let P be the probability measure *m G obtained by 

selecting each entry of the weight matrix C(i,~j) independently 

according to distribution Pj , Then P, being the product measure 

generated by the measures Pj» is A.I. This class properly 

includes all distributions claimed in Splra £1-973] . . 

2. Suppose R is some probability measure on an arbitrary 

set B, and f :G ■* B is a function which is invariant under each 

ncT : that is, f » n(G) = f(G) for all n* T and G e & . Then 
n n 

the induced probability measure R »,f. on G is A.I. by construc- 
tion. Some specific examples include cases in which the weights 
of edges might be specified but all choices of endpoints are 
equally likely. For example, in the discrete case, one might 
specify a distribution on the sum of the weights of the edges 
leaving each node, and specify that each graph with the same sums 
be equally likely. Or one might specify a distribution on the 
maximum weight of any edge in the graph and specify that each graph 
with the same maximum be equally likely. 
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Section _F. Open Questions 

1. Schnorr [1978a] has recently exhibited an 0(n 2 ) average- 
time algorithm for transitive closure which improves on the ideas 
above. His algorithm does not construct the paths between all 
pairs of nodes; does a transitive closure algorithm exist which 
does construct the paths (as the algorithm P. does) and which has 
o(n log n) average time? 

2. Is there a larger lower bound to the worst-case running time 
of any shortest-path algorithm than the 8(n 2 ) obtained by adversary 
arguments? Can a lower bound on the average-case running time of 
such algorithms be established? For a related paper, see [Yao, Avis, 
and Rivest 1977, Graham, Yao, and Yao 1978, and Yao and Rivest 1978], 

3. The fastest known algorithm for computing the transitive 

2 81 
closure requires 0(n •'"•"J basic steps in the worst case DFischer 

and Meyer 1971]. Can this algorithm be improved upon?t Can Fredman's 

(worst-case 0(n )) shortest-distance algorithm be improved upon? 

(Fredman [1975] has observed that Spira's algorithm may be 

modified to compute the shortest-distance matrix M with a average total 

number of comparisons of only 0(n 2 log n) but at a significant increase 

in total running time when all operations are considered.) 



t Recently, Pan [1978] has improved on this result and has 
announced (1979) an even further improvement. 
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APPENDIX 1 

In this appendix we s4cet.cn a proof pointed out to the author 
by V. Chvatal + of the lower bound on H(n) stated in Theorem 3.14. 
We use notation as in Chapter 3. 

Theorem : Suppose 0<e<l/8, Then, for arbiUrily large n» there 
is an n-node undirected gi 
Hence H(n) a n-n^'^+l. 



is an n-node undirected graph G such that D(G ) ;> n-n^ '+1. 



Proo f Sketch ; Suppose 0<e<l/8, We will show, the existence of an 
n-node undirected graph G (for arbitrarily large n) for which 
D(G ) £ n-n +1 . First we make some observations. 

E 

Suppose that G is an arbitrary undirected graph, and let 
C = ( n i '• * • '^m) be an exact cover of G by complete bipartite 
graphs G, ,...,G. Suppose G. is a complete k.-A. bipartite 
graph for 1 c [l:m]. Observe that if both k. and i. are at 
least 2 for some i£[l:m], then G must contain a 4-cycle; that 
is, there must be nodes a,b,c, and d in G such that edges 
{a,b}, {b,c}, {c,d}, {d'.a} are in G. Hence if G were a graph 
with no 4-cycles, we may without loss of generality assume that 
each graph G. is a (!,£.) complete bipartite graph. Let v. 



f Private communication, 1977. Recent research byBernond and Chung 
[Bermond 1978] have shown that 



n-n 19/24 + 6 < H (n) < n - log 3 n + 0(1) 



for arbitrary 6>0. 
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be the node in h.(G. ) corresponding to the singleton set 1n G.. 

It is clear that {v. | ie[l:m]} is a node cover of G , 
that is, a subset of the vertices cf 6 such that every edge in 
G contains at least one member of the set. Hence, D(G) equals 
the size of the smallest node cover of G,a$ lonf as. G contains 
no 4-cycles. 

If* 0<e<l/f is arbitrary, then Erdos [1959] has proved the 
existence of an arbitrarily large graph G with n nodes which 
has no 4-cycles, and for which ewery set of at least n " e nodes 

spans at least one edge. Hence any node cover for G can omit 

1-c 1 -fe- 

at most n -1 nodes, so DfG ) > n-n +1. 

e 

Q theorem* 
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APPENDIX 2 



In this appendix we explore several questions in the M-circuit 
complexity of combinations of functions. A general theorem, due to 
Galbiati and Fischer, is presented in which a combination of functions 
f and g is considered in which f and g depend on one variable 
in common . This general proof includes as a special case M. Fischer's 
result that MC(f*g) = MC(f) + MC(g) [Paul 1976]. Finally, we 
remark that these results also apply to the measures MC and M(^ . 
Additional research on combinations of functions which are the dis- 
junction of variables has been reported by Lamagna [1975], Neciporuk 
[1971], and Tarjan [1976] . 

1. Theorem (Galbiati and Fischer) : 

Suppose n,m,k, and JielN, and feB n+1 ^ and geB^-j £ are 
m.b.f. 's. 

^^- f A ^B n+m+1 f k+£ by 

fag(x-| x n ,y ls ...,y m ,z) = (f(x ] ,...,x n ,z),g(y 1 ,...,y m ,z)). 

Then MC(fAg) = MC(f) + MC(g). 



Proof : Clearly MC(fAg) < MC(f) + MC(g); we prove the reverse 



This result has been nresented in [Galbiah and Fischer, 1978] since 
the writing of this paper. 



-208- 



Inequallty. 

Suppose N .is a minimal M-circuit which computes fAg. We 
will say a gate G e Gates(N) is a mixed gate if G depends struc- 
turally on some variable from X = {x-j,...,x n } and some variable 
from Y = {y,,....^}. We show that it is possible to re-structure 
N to eliminate all mixed gates and result in a circuit N' which 
has no more gates than N and which still computes fAg. If this 
is the case, then N consists of two disjoint circuits; one consists 
of Succ*(X,N)u{z} and contains a gate computing f(x,z), and the 
other consists of Succ*(Y,N )u{z} and contains a gate which computes 
g(y,z). Hence the total number of gates in IT (and therefore N) 
is at least MC(f) + MC(g). 

So let MIX = {GeGates(N)|G is a mixed gate} and suppose 
MIX t 0. Then I (MIX) f 0, so we can select an arbitrary gate 
G € I(MIX) and let Pred(G) = {H,J}. Since N is minima? ,' neither 
H nor J is a constant node. We will replace G with another 
gate which is not mixed and yet still have a circuit whteh computes 

fAg. 

Since H and J are not in MIX, but G 1s, either H depends 
solely on Xu{z} and J depends on Yo{z}, or vice versa; suppose 
w.l.o.g. that H depends on Xu{z} and J depends on Yu{z}. 

We first consider the case that G 1s an A-gate. In this case 

PK6.N) c PI(H,N)-PI(J,N) ; 0) 
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that is, every prime implicant of G is a product of a prime impli- 
cant of H and a prime implicant of J. By Lemma 2.8 , since no 
component of f*g has a prime implicant which contains both a var- 
iable from X and one from Y, we may eliminate from PI(G,N) all 
such monomes which contain both a variable from X and a variable 
from Y, and still have a circuit which computes fag. We consider 
seveal cases. 

Case 1 : Neither PI(H,N) nor PI(J,N) contains the monome z. 

In this case, every monome in PI(B,N) contains an X-variable 
and every monome in PI(J,N) contains a Y-variable. Hence, by (1), 
every monome in PI(G,N) contains both an X-variable and a 
Y-variable so G may be replaced by a node which computes the 
function V0, the constant function 0. Since this new network has 
one fewer gate (namely G) than N, this contradicts N's minimality 
and hence Case 1 can not occur. 

Case 2 : z is a member of both PI(H,N) and PI(J,N). 

In this case, z e PI(G,N). By an argument similar to Case 1, 
one can also show that all monomes in PI(G,N) different from z 
contain both an X-variable and a Y-variable and hence may be 
eliminated from PI(G.N) while still leaving a circuit which 
computes fag. Hence G may be eliminated and replaced by the 
input node for the variable z, a contradiction to the minimality of N. 
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Case 3 : z is a member of exactly one of .--'PI(H.K) or PI(J,N). 

Assume w.T.o.g. that zePI(H.N). In this case, every monome 
of PI(H,N) other than z contains an X-var1able, and every 
monome of PI(J,N) contains a Y-variable. Thus every monome of 
PI(6) contains a Y-variable, and those which do not contain an 
X-variable in addition are of the form z«t', where t'ePIfJ) 
(note that t may also contain the liters! z). 

So let N' be the circuit obtained by replacing H in Pred(G.N) 
by the input node for variable z. Then PI(G.N') consists of those 
monomes of PI(G,II) which do not contain both an X^vari able and 
a Y-variable, and we -have a circuit N' which stilt computes 
fAg by Lemma 2.8 . Since G is not mixed in H^ we have reduced 
the number of mixed gates in N by one. 

In the event that G is an v-gate, we use an argument dual 
to the one above to also replace G by a non-mixed gate. Recur- 
sively repeating the above construction for each initial mixed gate, 
we obtain a modified M-circuit for fAg with no mixed gates, and no 
more total number of gates than the original. Hence the theorem is 
proven. Theorem 1. 

2. Corollary (Fischer) : Suppose that feB„ ,, and gcB m , are 
m.b.f's* Then MC(fxg) = MC(f) + MC(g), and any optimal circuit 
for fxg contains no mixed gates. 
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Proof : Define f (x, x *z) = f (x-r , . ... >x ft ) and 

9'(y 1 ..-.»y m .z) = g(y ] y m ). Then MC(f ) = MC(f) and 

MC(g') = MC(g): Also f* Ag' = fxg, so the first part of the 
corollary holds. Moreover since f'Ag* does not depend on variable 
z, cases 2 and 3 of the proof of Theorem 1 can never hold for a 
minimal circuit, and hence no mixed gates can occur. 

D Corollary 2. 

In a similar fashion to Theorem 1, one also proves that 
the following holds. 

3. Theorem : Suppose that feB_ u and g e B are m.b.f. 's. Then 

(1) MC A (fxg) ^ MC A (f) + MC A (g) 
and 

(2) MC y (fxg) = MC y (f) + MC v (g). 

The proof of part (1) is similar to that of Theorem 1; 
one considers an A-minimal circuit for fxg which among all such 
also has a minimal number of v-gates. Part ( 2 ) holds by duality. 

A similar result holds to Theorem 3 holds for fAg. 

Note : We observe that the analogous question to Theorems 1 and 2 
when f and g have 2 variables in common is false as the example 
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f(z 14 z 2 ) = g(z 15 z 2 ) = z-, a z 2 
demonstrates • 



*213- 



APPENDIX 3 



In this appendix we present a version of Spira's algorithm 
for computing shortest distances in a graph [1973] which has 
n{n 3 ) average running time over a certain class of graphs. 

This counter-example, which was previously reported in [Bloniarz, 
Fischer, and Meyer 1976], reveals why we needed to revise, correct, 
and verify Spira's original approach. 

Spira's algorithm operates by computing the distance from a 
particular "source" node i to all other nodes in the graph. By 
repeating the algorithm for each source node i, the shortest- 
distance matrix may be found. Two differences between this algor- 
ithm, which ws call S i , and the algorithm R 1 of Chapter 7 are 
noted. The first is that, in S ] . , no assumptions are made about 
the value returned by MINQ in the case in which there are nodes 
in the queue with equal KEY's; the queue might utilize information 
about the graph in breaking ties. This is a minor difficulty. 
The other difference is that when a near node has several edges 
of equal weight emanating from it, then these edges are not nec- 
essarily examined in successive passes of the algorithm. These in- 
complete specifications lead to the problems we describe later. A 
Pascal-like implementation of S. is given below: 



-214- 



ALGORITHM^ T (,Sfltra's Algorithm) 

FOR L := 1 to n DO D(L) := NIL; 
FOR L := 1 to n DO P(L) := 1; 
NEAR := {i}; D(i) := 0; J :- NR; CLEARQ; 
IF W(i,l)> NIL HO <KEY,J> := <W(iJ),1>; 
WHILE J f NIL DO 
BEGIN K := A(J,P(0)); 
IF MNEAR THEN 

BEGIN NEAR := NEARu{K}; 
D(K) := KEY; 

IF |NEAR| = n THEN HALT; 

IF W(K,1) f NIL THEN INSERTQ(KEY + W(K,1), K) 
END ; 
P(J) :* P(J) +1; 

lf_ W{J,P(J)) f NIL THEN INSERTQ( «Y + W(K,1),K); 
<K£Y,J> := MINQ 
END . 

The proof that S^ correctly computes the shortest distance 
D(j) from node i to node j is identical with that given in 
Chapter 7 for algorithm R.. 

To show that algorithm S^ can be implemented poorly, for 
simplicity we will restrict ourselves to the case in which all 
edge weights are either or ». This corresponds to computing 
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the transitive closure of a graph as mentioned In Chapter 7. 
In this case, every node in the priority queue has a KEY of zero, 
and the value returned by MINQ is an arbitrary pair In the queue. 

We present an example in which the priority queue is implemented 
so that the value returned by MINQ does not utilize any information 
about the graph (i.e. the matrices A ahd W) other than that 
given It by previous INSERTQ operations. In particular, we maintain a 
first-in, first-out queue to store the nodes in the priority queue. A node 
is INSERT' ed at the end of the queue, and MINQ removes and returns 
the node at the beginning of the queue. 

Theorem : Let P be the uniform probability distribution on n-node 
weighted directed graphs in which all edge-weights are zero or «■>. For 
any graph Ge<3 n . assume (as usual) that edges of equal weight in 
all adjacency lists are sorted by Increasing node index. If 
N.j(G) is the number of times the subroutine MINQ is called 
in executing S. on G, then, when implemented as described above, 



EpC^) = n(n 2 ) 



Proof : A straightforward argument shows that an average 
graph under this probability distribution 

(1) has at least n/3 edges emanating from each node 
and (2) has a path (of weight 0) from every node to e^ery other 
node in the gr?ph. 
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It suffices to show that there Is a constant c > such that if 
G 1s a graph satisfying (1) and (2), then t^.($) i cp . 

So suppose G satisfies (1) and 42)* Then, in the execution 
of S^ on G, node n will be placed in NEAR at some point. 
Hence, at some point in the execution we must have K = n; let jg 
be the value of J at the point (the edge from Jq to n 1s 
being "examined" at that point). Since j Q had at least n/3 , 
edges from it, and since the edge to n is the last edge listed 
in the jl row of A and W, we know that j Q oust have been 
placed 1n NEAR, placed on the queue, and subsequently returned by 
NINQ at least n/3 times. 

Following the A time that j« is returned by MINQ, an 
edge emanating from j Q is examined, and its endpoint A(Jq,a) 
is added to the queue (unless that element had previously been 
placed on the queue). The queue discipline implies that A(Jq,ji) 
will be returned by MINQ in between successive returns of in 
by MINQ, starting from the A time that j«; is returned by 
MINQ and continuing at least until either node n is placed in 
NEAR (after at least n/3 - I further returns of j Q by MINQ) or 
until every edge from A(j Q ,i) has been examined. Thus A( j Q ,&) 
is returned by MINQ at least min(n/3 - I - 1, n/3) - n/3 - % - 1 
times. Since this is true at least for 1 s i < n/3, the total 
number of calls to MINQ is at least 
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l (n/3 -£-!)= n(n 2 ) 



n/3-1 

I 
1=1 



and the statement is proved, 



□ Theorem. 



Since this theorem is true for every ie[l:n], a transitive 

closure (or shortest path) algorithm implemented as above will have 

3 
fi(n ) average time, contrary to Spira's original assertion. 
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